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and  show  how  the  causal  strength  of  a  variable  depends  on  it  being  a  devi¬ 
ation  or  difference  in  a  background.  This  has  implications  for  under¬ 
standing  causes  vs.  conditions/  the  differential  salience  of  explanations, 
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and  molar  vs.  molecular  explanations;  (2)  We  discuss  various  cues-to- 
causality  such  m  covariation,  temporal  order,  contiguity  in  time  and 
space,  and  similarity  of  cause  and  effect.  In  doing  so,  we  show  how  these 
f  cues  can  conflict  with  probabilistic  ideas.  A  model  for  combining  the 
background  and  cues  in  determining  the  strength  of  an  explanation  is 
proposed  that  includes  the  construction  of  a  causal  chain  between  an 
effect  and  its  presumed  cause.  The  model  is  used  to  discuss  a  wide 
range  of  studies  on  causal  judgments  and  explicates  methodological  issues 
such  as  spurious  correlation,  "causalation, M  and  causal  inference  in  case 
studies;  (3)  the  discounting  of  an  explanation  by  specific  alternatives 
is  discussed  ps  a  special  case  of  the  sequential  updating  of  beliefs. 

This  leads  to  a  consideration  of  various  factors  that  have  not  previously 
been  considered  important  in  the  discounting  process.  Finally,  we  eictend 
our  approach  to  consider  multiple  causation;  in  particular,  conjunctive 
explanations. 
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To  provide  a  framework  for  reviewing  the  relevant  literature  on  how 
judgments  of  probable  cause  are  Bade,  we  draw  upon  several  approaches  to  the 
study  of  causality;  specifically,  work  in  attribution  theory  (Haider,  1958; 
Kelley,  1973),  methodology  (Cook  S  Campbell,  1979),  and  philosophy  (Mackie, 
1974).  We  stress  that  each  of  these  areas  has  been  treated  in  detail  else¬ 
where  and  we  make  no  claim  to  providing  exhaustive  treatments  here.  Rather, 
the  focus  is  on  linking  causal  inference  with  research  on  judgments-under- 
uncertainty. 

General  approach 

We  assume  that  judgments  of  probable  cause  are  based  on  the  judged 
strength  of  the  causal  link  between  some  suspected  cause,  X,  and  some  effect, 
Y.  Moreover,  we  argue  that  the  strength  of  a  causal  link  is  a  function  of 
three  factors:  (1)  a  causal  background  or  context;  (2)  various  cues-to- 
causality  (such  as  temporal  order,  covariation,  contiguity,  and  similarity  of 
cause  and  effect);  and,  (3)  a  mechanism  for  discounting  causal  strength  when 
specific  alternatives  are  considered.  To  illustrate,  imagine  that  a  watch 
face  has  been  hit  by  a  hammer  and  the  glass  breaks.  How  likely  was  the  force 
of  the  hammer  the  cause  of  the  breakage?  Since  no  explicit  context  is  given, 
an  implicitly  assumed  neutral  context  is  invoked  in  which  the  cues-to- 
causality  point  strongly  to  a  causal  relation;  i.e.,  the  force  of  the  hammer 
precedes  the  breakage  in  time,  there  is  high  covariation  between  glass 
breaking  (or  not)  with  the  force  of  solid  objects,  contiguity  in  time  and 
space  is  high,  and  there  is  similarity  in  the  length  and  strength  of  cause 
and  effect.  Moreover,  it  is  difficult  to  discount  the  causal  link  since  there 
are  few  specific  alternatives  to  consider.  How  imagine  that  the  same  events 
occurred  in  a  watch  factory.  In  this  context,  the  cause  of  the  breakage  is 
more  often  judged  to  be  a  defect-in-the-glass  (Einhom  s  Hogarth,  1982a).  We 
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How  do  people  judge  the  likelihood  that  variable  X  causes  outcome  Y? 

The  complexity  of  the  answer  to  this  question  is  attested  to  by  the  size  and 
diversity  of  the  literature  on  causality  in  a  wide  variety  of  disciplines; 
e.g.,  philosophy  (Bunge,  1979  and  references),  law  (Hart  6  Honor e,  1959; 
Fincham  &  Jaspars,  1980),  medicine  (Shapiro,  1960;  Susser,  1973),  statistics 
(Suppes,  1970;  Cohen,  1977;  Toda,  1977),  psychology  (Michotte,  1946;  Piaget, 
1974).  From  our  perspective,  much  of  the  difficulty  in  assessing  causality  is 
due  to  the  fact  that  judgments  of  causation  are  typically  judgments  of 
probable  cause;  i.e.,  people  make  causal  inferences  which  take  place  under 
uncertainty.  Indeed,  many  practical  inferences  involve  the  combining  of 
causal  and  uncertainty  judgments.  For  example,  predictive  judgments  are 
typically  influenced  by  one's  causal  model  of  how  particular  variables  produce 
the  outcome  of  Interest;  diagnostic  judgments  are  concerned  with  determining 
the  likely  cause  of  outcomes,  events,  symptoms,  and  the  like;  attributions  of 
blame  and  responsibility  rest  on  judging  who  (or  what)  was  the  likely  cause; 
and  so  on.  (Xir  basic  contention  is  that  uncertainty  and  causality  are 
intertwined  in  a  great  many  situations  and  people  use  strategies  in  making 
causal  inferences  that  reflect  both  factors.  Indeed,  it  is  the  combination  of 
these  two  modes  of  thought  that  is  the  focus  of  this  paper.  However,  as  often 
happens  when  different  systems  are  combined,  conflict  and  inconsistency  can 
occur.  In  particular,  conflict  between  causal  and  probabilistic  reasoning  can 
lead  to  judgments  of  probable  cause  that  violate  probability  and  statistical 
theory.  We  therefore  consider  such  cases  because  they  highlight  the  various 
characteristics  of  each  system  and  the  difficulty  of  integrating  one  with  the 


other 


net  strength  of  evidence  that  X  causes  Y.  More  formally,  denote  SJc(X, Y)  as 
the  net  strength  of  X  for  Y  after  considering  k  specific  alternatives  (k 
■  0,  1,  K).  Furthermore,  let  s(X,Y)  be  the  gross  strength  of  X  for 

Y;  i.e. ,  its  plausibility  or  strength  before  specific  alternatives  are  con¬ 
sidered  (note  that  SQ(X,Y)  -  s(x,Y)).  The  gross  strength  of  X  for  Y  is 
determined  by  the  causal  background  and  the  cues-to-causality,  as  will  be 
discussed  in  detail  below.  Furthermore,  denote  Z^  as  the  kth  alternative 
causal  explanation  and  let  sfZ^Y)  be  the  gross  strength  of  Z^.  The  general 
model  is  then  given  by, 

Sfc(X,Y)  -  F[s(X,Y),  s(Zk,Y)]  (1) 

where,  S^lX,Y)  is  increasing  in  s(X,Y)  and  decreasing  in  s(Z^,Y).  Thus, 
the  net  strength  of  X  for  Y  increases  with  the  gross  strength  of  X,  and 
decreases  with  the  strength  or  plausibility  of  competing  alternatives. 

To  illustrate,  reconsider  the  initial  watch-hammer  scenario  and  contrast 
the  net  strength  of  the  "force  of  the  hammer”  explanation  with  the  net 
strength  of  any  single  explanation  for  the  following  questions: 

1.  Why  are  the  outer  rings  of  Saturn  braided? 

2.  Why  was  Ronald  Reagan  elected  President  in  1980? 

For  the  first  question,  it  is  difficult  to  generate  a  single  explanation, 
thus  suggesting  its  gross  strength  is  low.  However,  although  there  are  few 
competing  explanations,  net  strength  remains  low.  For  the  second  question, 
there  are  many  strong  explanations  (e.g.,  the  situation  of  the  economy;  the 
rise  of  the  moral  majority;  the  unresolved  Iranian  hostage  problem;  etc. ) . 
Therefore,  while  the  gross  strength  of  these  are  high,  the  net  strength  of 
any  single  one  is  low  precisely  because  the  others  are  highly  plausible 
alternatives.  On  the  other  hand,  the  watch-hammer  question  leads  to  high 
net  strength  since  the  explanation  is  strong  and  there  are  few  plausible 


alternatives,  in  short,  it  is  argued  that  like  good  patterns,  good 
explanations  have  few  alternatives  (Garner,  1970);  or,  to  be  more  precise, 
whereas  good  explanations  inply  few  strong  alternatives,  the  lack  of  strong 
alternatives  does  not  imply  good  explanations. 

While  equation  (1)  is  too  general  to  be  useful  in  an  operational  sense, 
it  shows  that  an  understanding  of  judgments  of  probable  cause  requires: 

(a)  a  model  of  how  the  gross  strength  of  an  explanation  is  determined;  and 

(b)  a  model  of  how  the  gross  strength  of  an  explanation  is  discounted  by 
alternatives. 

Plan  of  the  paper 

We  organize  our  discussion  and  review  around  the  various  issues  raised  by 
equation  (1).  Specifically,  we  first  discuss  the  concept  of  a  causal 
background  and  show  how  the  causal  strength  of  a  variable  depends  on  it  being 
a  deviation  or  difference  in  a  background.  Second,  we  discuss  the  causal  cues 
of  covariation,  temporal  order,  contiguity  in  time  and  space,  and  similarity 
of  cause  and  effect.  In  doing  so,  we  show  how  these  cues  can  conflict  with 
probabilistic  ideas.  Third,  a  model  for  how  the  background  and  cues  combine 
in  determining  the  gross  strength  of  an  explanation  is  presented.  The  model 
highlights  the  constraints  and  conflicts  between  cues  and  provides  a  framework 
for  integrating  much  research  in  causal  inference.  Fourth,  a  model  for 
discounting  the  gross  strength  of  an  explanation  on  the  basis  of  alternatives 
is  given.  The  model  serves  to  illustrate  the  types  of  problems  that  must  be 
addressed  in  the  discounting  process.  Fifth,  our  general  approach  is  extended 
to  deal  with  multiple  causation;  in  particular,  conjunctive  explanations. 
Finally,  various  descriptive  and  normative  Implications  of  our  framework  are 


discussed 


Aa  causal  background 

Much  research  indicates  that  processes  of  perception  and  judgment  are 
sensitive  to  differences  or  deviations  fron  present  states,  adaptation  levels, 
and  reference  points  (e.g.,  Helson,  1964;  Kahneman  s  Tversky,  1979).  More¬ 
over,  this  sensitivity  also  applies  to  the  types  of  events  that  trigger  the 
asking  of  causal  questions.  Specifically,  we  believe  that  events  of  causal 
interest  are  those  that  are  unusual,  abnormal,  or  unlikely  (cf.  Has tie,  1984). 
Thus,  one  rarely  seeks  the  cause  of  why  one  feels  "average, "  why  traffic 
flowed  normally,  or  why  some  accident  is  typical.  To  be  sure,  the  need  for 
explanations  can  be  aroused  vis-a-vis  normal  events.  However,  this  is  most 
likely  to  happen  when  those  events  violate  expectations  and  are  therefore  seen 
as  unusual  after  all.  For  example,  we  might  want  to  know  why  traffic  flowed 
normally  if  major  highway  improvements  were  just  couple ted,  or  why  we  feel 
"average"  after  hearing  about  a  death  in  the  family.  Therefore,  causal 
inference  is  often  invoked  to  make  sense  of  various  kinds  of  deviations. 
However,  it  is  important  to  note  that  the  meaning  of  a  deviation  is  itself 
crucially  dependent  on  some  assumed  background  or  field.  Indeed,  even 
averages  can  be  made  unusual  with  the  appropriate  shift  of  background— 
consider  Oscar  Hilda's  statement  that  "moderation  shouldn’t  be  taken  to 
extremes." 

He  now  consider  how  the  background  affects  judgments  of  probable  cause 
and  discuss  two  related  issues:  (1)  the  role  of  the  background  in  distin¬ 
guishing  causes  from  conditions;  (2)  how  the  background  makes  alternative 
causes  more  or  less  salient. 

(1 )  It  has  been  argued  that  causal  relevance  is  generally  related  to  the 
degree  that  a  variable  is  a  difference-in-a-background  (Mackie,  1974).  By 
this  is  meant  that  factors  that  are  part  of  some  presumed  background  are 


judged  to  b*  of  little  or  no  causal  relevance.  For  example,  does  birth  cause 
death?  While  the  former  is  both  necessary  and  sufficient  for  the  latter  (and 
thus  covaries  perfectly  with  it),  it  seems  odd  to  consider  one  the  cause  of 
the  other.  The  reason  is  that  death  presupposes  that  one  has  been  born. 
Therefore,  birth  is  part  of  the  background  and  its  causal  relevance  is  low. 
However,  not  all  factors  that  are  differences-in-a-background  are  seen  as 
causally  relevant.  In  particular,  one  must  distinguish  between  causes  and 
conditions.  To  illustrate  the  distinction,  consider  diseases  that  are 
generally  confined  to  particular  segments  of  the  population  (e.g. ,  children, 
or  the  elderly).  It  seems  peculiar,  for  example,  to  say  that  Alzheimer's 
disease  is  caused  by  being  old.  Rather,  being  old  suggests  various  enabling 
factors  that,  when  conjoined  with  some  other  event(s)  (e.g.,  virus,  bacteria, 
etc.),  produces  the  disease.  Indeed,  being  old  may  be  a  necessary  condition 
for  the  disease,  but  not  its  cause.  As  we  argue  below,  the  distinction 
between  particular  causes  and  conditions  is  highly  dependent  on  some  assumed 
background.  However,  causes  can  generally  be  distinguished  from  conditions  in 
that:  (a)  Events  are  more  causal  than  standing  conditions  (viruses  are  more 

causal  than  being  old);  (b)  Intrusive  events  are  more  causal  than  events  that 
generally  occur;  (c)  Something  abnormal  or  wrong  is  more  causal  than  what  is 
normal  and  right  (e.g.,  the  car  accident  was  caused  by  the  person  veering  to 
the  left,  not  by  the  other  person  who  drove  straight  ahead). 

How  consider  how  a  shift  in  background  can  change  a  probable  cause  into  a 
condition,  with  a  corresponding  reduction  in  its  causal  strength.  Recall  our 
earlier  example  of  a  watch-face  being  hit  by  a  hammer.  In  the  original  con¬ 
text,  a  hammer  that  hits  glass  is  an  intrusive  event  that  does  not  generally 
occur  while  a  defect-in-the-glass  is  a  standing  state.  However,  in  a  watch 
factory,  the  hitting  of  glass  by  hammers  generally  occurs  while  a  defect  is 


something  abnormal  or  wrong.  Hence,  in  this  context,  the  hitting  of  glass 
becomes  more  of  a  condition  or  standing  state  and  the  explanation  loses  ouch 
of  its  causal  strength.  Similarly,  the  defect  explanation  gains  in  causal 
strength  due  to  its  abnormality  in  these  circumstances.  However,  it  is 
possible  to  decrease  the  relative  causal  strength  of  the  defect  explanation  by 
making  it  less  abnormal.  For  example,  consider  that  all  of  the  watch  faces 
were  being  broken  in  the  testing  procedure.  This  suggests  that  the  force  of 
the  hammer  was  too  great  (it  being  inappropriate  for  a  testing  procedure) , 
and  its  causal  strength  is  thereby  increased. 

The  importance  of  background  shifts  on  causes  vs.  conditions  is  not 
limited  to  physical  causation.  Indeed,  consider  the  work  on  actor  vs. 
observer  differences  in  attribution  theory  (Jones  &  Nisbett,  1972;  Watson, 

1982  for  a  recent  review).  The  basic  finding  is  that  whereas  observers  tend 
to  attribute  the  causes  of  an  actor's  behavior  to  the  dispositional  traits  of 
the  actor,  actors  tend  to  attribute  the  causes  of  their  own  behavior  to 
situational  variables.  Note  that  from  the  actor's  perspective,  his/her  own 
traits  are  standing  states,  while  situations  are  more  intrusive  and  hence  more 
causal.  From  the  observer's  perspective,  situations  are  seen  as  standing 
states  and  the  dispositions  of  actors  are  more  intrusive.  In  fact,  this 
asymmetry  was  noted  by  Jones  and  Nisbett  (1972)  who  stated,  "For  the  observer 
behavior  is  figural  against  the  ground  of  the  situation.  For  the  actor  it  is 
the  situational  cues  that  are  figural  and  are  seen  to  elicit  behavior”  (p. 

93).  In  our  analysis,  therefore,  much  should  depend  on  the  familiarity  of 
observers  with  particular  actors  (e.g.,  if  person  A  is  well  known  to  the 
observer,  A's  dispositions  may  be  seen  as  standing  states),  as  well  as  the 
actor's  familiarity  with  particular  situations.  And  indeed,  although  we  know 
of  no  studies  that  have  addressed  the  latter  prediction,  some  evidence  exists 


supporting  the  former  (Goldberg,  1981,  but  see  Watson,  1982).  Moreover,  a 
shift  in  background  does  not  mean  that  previous  causes  lose  all  their  strength 
or  that  prior  conditions  are  now  seen  as  strong  causal  candidates.  Clearly, 
changes  in  causal  strength  are  one  of  degree  and  relative  effects  can  vary 
considerably.  In  his  review,  Watson  (1982)  notes  that  dispositions  generally 
receive  more  weight  than  situations  for  both  actors  and  observers,  but  that 
the  relative  weight  for  dispositions  is  greater  for  observers  than  for  actors. 

the  influence  of  background  shifts  also  clarifies  a  perennial  issue  in 
scientific  inquiry — reductionists  in  causal  explanations  (causes  at  a  molar 
level  are  different  from  those  at  a  molecular  level).  If  one  thinks  of  the 
background  as  analogous  to  the  field  of  vision  under  a  microscope,  then  shifts 
in  magnification  of  the  lens  define  different  fields.  Moreover,  since  causal 
relevance  is  at  least  a  difference-in-the-field,  it  is  obvious  that  a  cause  at 
one  level  will  not  necessarily  be  relevant  at  another.  This  microscope 
analogy  makes  clear  that  the  "appropriate"  level  of  magnification  depends  on 
one's  purposes  and  the  extent  of  one’s  knowledge  of  the  phenomenon  in 
question.  Thus,  a  biochemist  may  see  the  causal  link  between  smoking  and  lung 
cancer  as  due  to  chemical  effects  of  tar,  nicotine,  and  the  like,  on  cell 
structure,  while  an  immunologist  might  see  the  causal  link  as  due  to  the 
suppression  of  the  immune  system  in  controlling  diseases  in  general.  However, 
it  should  be  noted  that  the  level  of  the  field  is  not  totally  arbitrary  in 
everyday  inferences.  Indeed,  there  is  remarkable  consensus  among  individuals 
as  to  the  appropriate  level  of  the  assumed  background.  On  the  other  hand, 
where  large  discrepancies  exist  in  knowledge  about  a  particular  topic,  as  in 
comparing  experts  to  non-experts,  such  consensus  is  often  lacking. 

(2)  In  addition  to  defining  differences/deviations  and  distinguishing 
between  causes  and  conditions,  the  causal  background  plays  an  important  role 


in  affecting  the  number  and  salience  of  alternative  explanations.  That  is,  if 
one  considers  the  causal  background  as  containing  the  set  of  alternatives 
against  which  a  particular  causal  candidate  is  evaluated,  then  shifts  in  the 
background  can  highlight,  or  de-emphasize,  classes  *  alternatives.  For 
example,  imagine  the  following  scenario:  Joe  is  a  chemical  worker  who 
contracts  lung  cancer  and  sues  the  company  for  causing  his  disease.  His 
lawyer  argues  that  the  cancer  rate  of  workers  in  this  factory  is  nine  times 
the  national  average  for  workers  in  comparable  industries.  Note  that  the 
background  in  this  argument  is  industries  of  a  certain  type  and  the  causal 
argument  rests  on  there  being  a  difference  (higher  cancer  rates)  in  this 
background.  However,  the  lawyers  for  the  chemical  company  may  shift  the 
background  by  arguing  that  Joe  has  smoked  cigarettes  for  years,  comes  from  a 
family  with  a  history  of  cancer,  and  so  on.  Hote  that  the  background  is  now 
changed  to  people  with  certain  personal  habits  and  characteristics  thereby 
making  these  habits  and  characteristics  highly  salient  alternatives.  More¬ 
over,  the  background  shift  also  changes  Y  from  an  unusual  event  that  requires 
a  special  causal  explanation  (working  in  a  chemical  plant)  to  a  less  unusual 
event  that  requires  no  such  explanation. 

In  addition  to  shifts  in  the  background,  alternatives  can  be  ruled  in  or 
out  depending  on  the  breadth  of  the  background.  In  the  above  scenario,  for 
example,  note  how  Joe's  case  would  be  strengthened  if  it  could  be  shown  that 
the  cancer  rate  in  his  factory  was  nine  times  the  rate  of  other  chemical 
factories  making  exactly  the  same  product.  The  reason  is  that  by  narrowing 
the  field  to  chemical  plants  making  the  same  product,  the  number  of  alterna¬ 
tives  explanations  is  reduced,  thereby  making  the  difference  in  the  narrowed 
field  more  causally  relevant.  A  similar  idea  has  been  advanced  by  Bar-Hillel 
(1980).  In  considering  the  research  showing  that  people  ignore  base  rates  in 


■sking  probability  judgment*,  she  demonstrates  that  base  rates  will  be  used  if 
they  are  made  more  specific  or  if  they  can  be  given  a  causal  interpretation. 
She  suggests  that  both  specificity  and  a  causal  interpretation  increase 
the  "relevance”  of  information,  and  thus  its  use.  From  our  perspective, 
Bar-Hillel's  treatment  of  relevance  is  consistent  with  our  view  of  probable 
cause;  both  are  increased  by  specific  causal  evidence  and  a  narrowing  of  the 
background  (specificity)  which  reduces  alternatives. 

Cues-to-causality 

Once  a  particular  background  or  context  has  been  evoked,  the  strength  of 
a  causal  link  between  X  and  Y  depends  on  various  cues-to-causality.  We  note 
that  the  term  "cues”  has  a  specific  meaning  that  corresponds  with  its  use  in 
Brunswik's  psychology  (1952;  also  see  Hammond,  1955;  Campbell,  1966).  There¬ 
fore:  (1 )  The  relation  between  each  cue  and  causality  is  probabilistic.  That 

is,  each  cue  is  only  a  fallible  sign  of  a  causal  relation;  (2)  People  learn 
to  use  multiple  cues  in  making  inferences  in  order  to  mitigate  against  the 
potential  errors  arising  from  the  use  of  single  cues;  (3)  The  use  of  multiple 
cues  is  often  facilitated  by  the  intercorrelation  (redundancy)  between  them  in 
the  environment.  This  both  reduces  the  negative  effects  of  omitting  cues,  and 
aids  in  directing  attention  to  the  presence  of  others;  (4)  Although  multiple 
cues  can  reduce  uncertainty  in  causal  inference,  conflicting  cues  can  increase 

it.  As  we  shall  see,  conflicting  cues  often  require  that  distinctions  be  made 
with  respect  to  the  type  of  causal  relation  present  (e.g.,  precipitating  vs. 
underlying  cause) .  We  consider  these  issues  later,  in  connection  with  a  model 
for  combining  the  cues. 

In  discussing  the  cues,  we  consider  each  at  some  molar  level  although  we 
are  fully  aware  that  they  can  be  decomposed  into  more  molecular  components. 

For  example,  contiguity  can  be  decocposed  into  temporal  and  spatial  cues  and 
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temporal  contiguity  can  be  further  divided  Into  the  time  Interval  between 
cause  and  effect  and  the  regularity  of  the  Interval.  The  Importance  of 
considering  the  elements  of  each  cue  will  become  apparent  as  we  proceed. 

The  cue  of  covariation  is  considered  first  since  it  has  received 
considerable  attention  by  numerous  authors  (Kelley,  1973;  Nisbett  &  Ross, 

1980;  Crodcer,  1981;  Alloy  S  Tabachnik,  1984).  We  begin  by  noting  that  when 
people  judge  that  X  caused  Y  (or  X  will  cause  Y),  they  rarely  mean  that  X  is 
either  necessary  or  sufficient  for  Y.  For  example,  if  a  fire  inspector  said 
that  a  short-circuit  (X)  caused  a  house  fire  (Y),  it  is  clear  that  x  is 
neither  necessary  nor  sufficient  for  Y.  Rather,  it  means  that  X,  when 
conjoined  with  a  particular  set  of  conditions,  leads  to  Y  (e.g.,  flammable 
material  near  the  short  circuit,  no  sprinkler  system,  etc. ) .  If  we  denote  W 
as  representing  flammable  material  near  the  short-circuit  and  Z  as  the  absence 
of  a  sprinkler  system,  then  define  a  as  the  scenario  in  which  X  and  w  and  Z 
occurred;  i.e.,  a  «  (xDwnz|B)  where,  B  -  causal  background.  After  Mackie 
(1965),  we  call  a  a  minimally  sufficient  condition  for  Y  if. 


p(Y|a)  -  1  but,  p(o|Y)  *  0,1  (2) 

Thus,  given  a,  Y  always  follows;  however,  given  Y,  a  is  neither  certain 
nor  impossible.  The  scenario  a  is  said  to  be  minimally  sufficient  if  the 
deletion  of  any  of  its  conjuncts  makes  a  no  longer  sufficient.  Furthermore, 
it  is  important  to  note  that  the  sufficiency  of  a  always  presupposes  some 
causal  background,  otherwise  the  number  of  conjuncts  in  a  would  be  very 
large.  For  example,  it  is  assumed  that  oxygen  is  part  of  the  causal  field 
since  there  can  be  no  fires  without  its  presence.  Indeed,  since  oxygen  is  a 
standing  state,  it  can  be  seen  as  a  necessary  condition  and  thus  part  of  the 
background  in  this  context. 

If  we  new  consider  the  short-circuit  as  the  probable  cause  of  Y,  note 
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that  X  is  nscassary  f or  the  scenario  a  but  it  is  not  sufficient;  i.e. , 

p(xja)  -  1  but,  P(ajx)  *  0,1  (3) 

He  can  now  say  that  X  is  an  JLnsuf f icient  but  ^necessary  part  of  a  complex 
scenario  (a),  which  is  itself  unnecessary  but  sufficient  for  Y  (INOS  for 
short) .  Mackie  notes  that  the  statement,  "the  short-circuit  caused  the  house 
fire,"  implies  that;  (a)  X  is  at  least  an  INUS  for  Y;  (b)  X  occurred;  (c)  the 
other  conjuncts  occurred;  (d)  all  minimally  sufficient  conditions  for  Y  not 
having  X  in  them  were  absent  on  the  occasion  in  question. 

The  above  conceptualization  provides  a  useful  and  important  link  between 
probable  causes  and  covariation.  Consider  a  2  x  2  table  in  which  X  and  x  are 


Insert  Table  1  about  here 


crossed  with  Y  and  Y.  The  upper  left-hand  cell  (cell  JO  contains  the  number 
(and  frequency)  of  scenarios  (“yfix^  *n  wh^-C*'  x  conjoins  with  various  con¬ 
ditions  to  cause  Y.  The  upper  right  hand  cell  (cell  B)  contains  scenarios  in 
which  X  occurs  but  the  remaining  conditions  are  insufficient  for  Y  (a-^) . 
This  can  result  if  there  are  too  few  enabling  conditions  to  allow  Y's  occur¬ 
rence;  or,  conditions  exist  that  cancel  or  prevent  X  from  causing  Y.  For 
example,  imagine  that  X  was  a  particular  virus,  Y  was  some  disease,  and  w  was 
an  antibody  that  existed  in  some  proportion  of  the  population.  If  w 
counteracts  X,  then  the  conjunction  (xflw)  does  not  produce  the  disease.  He 
denote  the  frequency  of  a-^  scenarios  as  indicating  the  "conditionality  of 
causation."  That  is,  the  larger  this  cell,  the  more  X  is  either  sensitive  to 
cancellation  or  the  more  it  has  to  conjoin  with  other  conditions  to  produce  Y. 
Now  consider  the  lower  left-hand  cell  (cell  C),  which  represents  the 

scenarios  a.  -  .  This  cell  contains  the  minimally  sufficient  conditions  for 

Ynx 

Y  in  which  X  is  absent  and  thus  indicates  the  existence  of  alternative  causes 


.v.v.v 


YY-Y 

•Y-Y-Y 


;.\.v.--yy-yyy  y-y-y >Y-y->yy-> Y-Y-Y-  .*•  Y-  Y-Y-Y-Y-Y 


Effect 


Y 


Y 


°Ynx 

“yHx 

conditionality 

(A) 

(B) 

“yOx 

“yHx 

multiplicity 

(C) 

(D) 

of  Y.  To  contrast  this  source  of  uncertainty  with  the  "conditionality  of 
causation, "  we  consider  the  frequency  of  a  -  scenarios  as  representing  the 

X|  |A 

"Multiplicity  of  causation."  Finally,  the  lower  right-hand  cell  contains 
those  scenarios  in  which  neither  X  nor  V  occurs,  ct^-  .  This  cell  is 
problematic  since  it  implicitly  defines  the  context  in  which  x  and  Y  are  being 
considered.  For  example,  consider  the  relation  between  smoking  (X)  and  lung 
cancer  (Y).  Cell  A  contains  the  minimally  sufficient  scenarios  for  Y  in  which 
X  occurs;  Cell  B  contains  insufficient  scenarios  for  Y  in  which  X  occurs;  Cell 
C  contains  sufficient  scenarios  for  Y  that  do  not  contain  X;  Cell  D  contains 
the  scenarios  in  which  no  smoking  conjoins  with  other  conditions  to  lead  to  no 
lung  cancer.  In  this  latter  cell,  questions  arise  as  to  what  objects  to 
include  as  having  X  and  Y.  For  example,  should  we  allow  children  to  be 
included  (after  all,  they  don't  smoke  and  don't  have  lung  cancer)?;  if  not,  at 
what  age  are  we  to  define  the  relevant  population?;  Should  the  analysis  be 
different  for  men  vs.  women;  people  with  a  history  of  cancer  in  the  family  vs. 
those  without,  etc.?  What  is  important  to  note  is  that  covariation,  by  its 
very  nature,  implies  a  reference  population  and  thus  some  background  against 
which  the  relation  should  be  assessed. 

The  above  conceptualization  has  several  general  implications.  First,  the 
traditional  notions  of  necessity  and  sufficiency  of  causation  can  be  repre¬ 
sented  by  covariation  in  that  a  necessary  cause  is  one  that  is  in  all 
minimally  sufficient  scenarios  for  Y  (Implying  that  the  C  cell  id  empty);  a 
sufficient  cause  is  one  that  always  conjoins  with  conditions  to  produce  Y 
(implying  that  the  B  cell  is  empty).  Second,  the  idea  of  considering 
scenarios  of  events  as  conprising  the  "raw  data"  for  judging  covariation  is 
consistent  with  the  use  of  this  cue  in  singular  cases;  i.e.,  where  the  data 
consists  of  a  single  instance.  For  example,  imagine  that  you  feel  ill  today 


and  recall  that  you  at#  fiah  last  night.  Since  there  Is  only  one  observation, 
covariation  sight  be  thought  irrelevant.  However,  one  can  imagine  (or 
remember)  scenarios  involving  illness  without  eating  fish;  and,  one  can 
imagine  (or  remember)  scenarios  in  which  a  fish  meal  produces  (or  produced)  no 
ill  effects.  Furthermore,  even  if  one's  imagination  or  experience  is  limited, 
one  is  aware  of  the  uncertainties  associated  with  both  the  multiplicity  and 
conditionality  of  causation.  Shat  is,  one  is  aware  of  one's  incomplete 
knowledge  regarding:  (a)  all  the  conditions  that  can  conjoin  with  X  to  produce 
Y;  and,  (b)  the  alternatives  that  can  cause  Y  in  the  absence  of  X.  Put 
succinctly,  people  know  that  they  don't  have  complete  causal  knowledge. 
Therefore,  the  awareness  of  incomplete  knowledge  provides  the  basis  for  why 
causal  judgments  are  generally  judgments  of  probable  cause. 

He  now  consider  some  specific  implications  of  a  scenario-based  approach 
to  causation  for  discussing  the  literature  on  judgments  of  covariation.  To 
begin,  we  conceive  of  covariation  judgments  (denoted  as  Q1 )  as  resulting  from 
a  weighted  linear  combination  of  the  cell  frequencies  from  a  2  x  2  table. 

Thus, 
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where,  q,  -  o^,  q2  -  a^X;  q3  -  q4  -  o^j,  0t  -  weight  given  to 

Si¬ 
tuation  (4)  provides  a  simple  and  convenient  way  of  summarizing  much  of 
the  research  on  covariation  judgments.  For  example,  Smedslund  (1963)  and 
Jenkins  and  Hard  (1965)  showed  that  their  subjects'  judgments  were  based 
almost  exclusively  on  o_  (i.e.,  a  >0,  0,-0,  «  a  -  0);  Ward  and 

Jenkins  (1965),  however,  changed  the  way  information  was  presented  to  subjects 
(from  sequential  to  intact  displays),  and  found  different  patterns  of  use 


(many  subjects  Ignored  disconf irming  evidence,  i.e.,  0  •  0  ■  0;  many  other 

subjects  weighted  all  cells);  Binhorn  and  Hogarth  (1978)  noted  that  informa¬ 
tion  is  frequently  absent  froa  real-world  tasks  such  that  0^ ,  &2  >  0  but 

8  -  8  -  0.  Since  such  of  the  research  on  covariation  judgments  compares 
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intuitive  responses  with  some  statistical  measure  of  covariation  (usually  the 
correlation  coefficient— in  which  the  four  cells  are  equally  weighted),  the 
ignoring  of  certain  cells  is  often  taken  as  evidence  that  people  have  a  poor 
intuitive  concept  of  correlation.  However,  numerous  studies  have  also  shown 
that  people  can  and  do  make  use  of  all  of  the  q^'s  that  are  available  (Alloy 
&  Abramson,  1979;  Crocker,  1981).  A  recent  meta-analysis  speaks  directly  to 
this  issue.  Lipe  (1982)  has  shown  that  when  subjects'  responses  were 
regressed  onto  the  four  data  cells  over  a  number  of  studies,  the  weights  for 
all  four  q^'s  were  significant  and  in  the  expected  directions  (31  ,04  >  0; 
0^,0 .  <  0).  Therefore,  the  question  is  not  whether  people  sometimes  ignore 
relevant  information  but  rather,  under  what  conditions  is  particular  infor¬ 
mation  ignored  or  not?  For  example,  Crocker  (1982)  has  demonstrated  that  the 
type  of  question  asked  can  greatly  affect  attention,  and  thus  the  amount  of 
weight  given  to  each  cell  (also  see,  Arkes  fi  Harkness,  1983). 

We  believe  that  when  people  are  asked  to  judge  the  covariation  between 
two  variables,  they  may  Interpret  the  question  in  a  causal  way  (the  tendency 
to  equate  correlation  with  causation  is  very  strong;  otherwise,  why  warn 
students  who  take  statistics  courses  that  correlation  does  not  imply 
causation?).  Moreover,  if  questions  are  thought  to  reflect  causal  relations, 
one  would  expect  greater  use  of  all  four  cells  since  the  uncertainties 
(unknown  multiplicity  and  conditionality)  of  a  causal  relation  are  more 
salient  than  in  covariation  per  se.  In  fact,  Schustack  and  Sternberg  (1981) 
showed  that  when  subjects  were  given  information  in  the  form  of  the  q^'s  and 


were  asked  to  give  causal  judgments,  a  regression  model  like  equation  (4) 

yielded  results  showing  that  8,  S  >  0  and  3  ,  3  <  0.  In  another  study 
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(Waller  s  Felix,  1982),  subjects  were  asked  to  judge  the  same  information  by 
answering  both  a  causal  and  a  correlational  question.  In  accord  with  our  view 
that  covariation  is  a  fallible  cue  to  causality,  they  found  a  moderate  but 
significant  correlation  between  the  two  types  of  judgments  (r  -  .57) . 

If  causal  rather  than  covariation  questions  direct  attention  to  more 
' s,  one  implication  is  that  the  judged  causal  strength  of  two  variables 
should  be  closer  to  statistical  measures  of  correlation  than  are  judgments  of 
covariation.  However,  we  know  of  no  research  that  speaks  to  this  issue.  On 
the  other  hand,  causal  judgments  based  on  covariation  data  are  prey  to 
attentional  shifts  in  their  own  right.  Specifically,  causal  questions  can  be 
asked  in  a  future  or  forward  direction  (how  likely  will  X  cause  r?),  a  past  or 
backward  direction  (how  likely  was  V  caused  by  X?),  or  a  temporally  neutral 
manner  (how  likely  are  X  and  Y  causally  related?).  In  forward-causal- 
inference,  the  focus  of  attention  is  on  the  conditionality  of  causation  (cells 
A  and  B)  since  the  uncertainties  about  the  conditions  that  enable  X  to  produce 
Y  are  made  salient.  Indeed,  Schu stack  and  Sternberg  (1981)  asked  subjects  to 
make  forward-causal-inferences  and  found  that  the  regression  weight  for  cell  B 
was  higher  than  for  cells  C  and  0  (cell  A  had  the  highest  weight;  all  weights 
were  significantly  different  from  zero).  In  backward-causal-inference,  the 
focus  of  uncertainty  is  on  the  unknown  multiplicity  of  causation  (cell  C).  In 
fact,  some  philosophers  have  implicitly  assumed  a  backward  inference  direction 
in  discussing  causation.  Thus,  Mackie  (1974)  notes  that  in  assessing  whether 
X  caused  Y,  one  implicitly  asks  the  counter-factual  question,  "Would  Y  have 
occurred  if  X  had  not?"  Note  how  this  focuses  attention  on  the  multiplicity 
of  causation  by  asking  one  to  imagine  minimally  sufficient  conditions  for  Y 
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that  do  not  contain  X  (i.e.,  a^-) . 

Th«  idea  that  people  can  differentially  weight  the  q^'s  due  to 
attentlonal  shifts  is  an  important  yet  virtually  unresearched  area  in  judging 
probable  cause.  In  particular,  the  notion  that  the  direction  of  inference 
(forward/backward /neutral)  can  affect  the  relative  sizes  of  the  fS^'s  seems 
especially  promising.  Such  research,  for  example,  should  be  able  to  resolve 
the  issue  of  whether  "necessity"  or  "sufficiency"  of  causation  is  more 
strongly  related  to  judged  causal  strength.  According  to  our  conceptual 
scheme,  necessity  should  be  most  important  in  backward  inference  (would  Y  have 
occurred  if  X  had  not?),  but  sufficiency  should  be  most  important  in  forward 
inference  (could  Y  occur  if  X  does?).  When  causal  questions  are  asked  in  a 
temporally  neutral  way,  both  necessity  and  sufficiency  are  likely  to  play  a 
role  (i.e.,  cells  B  and  C  should  be  more  equally  weighted). 

In  addition  to  the  direction  of  inference,  another  factor  is  likely  to 
affect  causal  judgments  via  the  covariation  cue.  We  indicated  above  that  cell 
0  (a^-)  is  often  problematic  in  that  the  definition  of  what  constitutes  the 

absence  of  X  and  Y  depends  on  some  assumed  background  or  reference  population. 
Consistent  with  this  is  the  fact  that  call  0  is  rarely  weighted  as  heavily  as 
the  other  three  cells.  However,  there  is  a  sense  in  which  this  cell  is  impor¬ 
tant  in  that  it  raises  the  issue  of  the  robustness  of  the  X,Y  relation  over 
other  backgrounds  or  populations.  For  example,  imagine  a  high  correlation 
between  a  particular  diet  and  heart  disease  in  the  O.S.,  but  a  lack  of  any 
such  correlation  in  Western  Europe.  We  would  expect  that  in  judging  the  causal 
strength  of  X  and  Y,  robustness  will  be  seen  as  an  important  cue-to-causality. 
Indeed,  an  interesting  and  unresearched  question  concerns  whether,  or  to  what 
degree,  people  are  willing  to  trade-off  high  covariation  (in  one  setting)  with 
lower  covariation  that  is  robust  over  several  backgrounds. 
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tnporal  order,  contiguity,  and  similarity 

(to  new  consider  the  cues  of  ts mporal  order,  contiguity  in  tine  and  space, 
and  similarity  of  causa  and  effect.  Temporal  order  (denoted  Q2)  is  dis¬ 
cussed  first.  The  importance  of  temporal  order  in  causal  judgments  seams 
obvious  since  it  is  essential  for  determining  which  of  two  variables  in  a 
relation  is  causa  and  which  is  effect.  However,  whereas  temporal  order 
greatly  affects  causal  judgments,  this  cue  has  no  role  in  formal  probability 
theory.  This  suggests  that  there  is  likely  to  be  much  conflict  and  confusion 
in  probability  judgments  when  temporal  order  is  salient.  We  illustrate  this 
conflict  by  considering  three  issues:  the  confusion  of  joint  and  conditional 
probabilities,  causal  vs.  diagnostic  probabilities,  and  the  confusion  of 
inverse  probabilities  (Dawes,  undated). 

An  important  feature  of  temporal  order  is  the  speed  and  facility  with 
which  it  is  used— often  without  explicit  awareness.  This  is  particularly  the 
case  in  the  interpretation  of  language;  consider  the  conjunction  "and, ■  which 
frequently  implies  temporal  order  in  everyday  English  (Strawson,  1952). 
However,  the  word  "and"  is  used  in  formal  probability  to  denote  a  conjunction 
of  events,  irrespective  of  temporal  order.  Therefore,  how  should  one  under¬ 
stand  a  question  like,  "What  is  the  probability  of  going  into  the  supermarket 
and  buying  some  coffee?"  If  we  let  S  -  going  into  the  supermarket,  and  K  “ 
buying  coffee,  the  probability  of  S  and  K  could  be  interpreted  in  several 
ways.  Whereas  a  statistician  would  represent  the  question  as  p(SflK)  and 
ignore  the  temporal  meaning  of  "and,"  others  may  perceive  the  question  as 
requiring  the  conditional  probability,  p(K|s).  That  is,  one  could  consider 
the  sequence  of  events  as  being  important  and  condition  the  uncertainty  of  the 
later  event  (K)  on  the  earlier  one  (S).  Indeed,  to  direct  attention  to  the 
conjunction  of  the  events,  it  might  be  helpful  to  reverse  S  and  K  in  order 


to  break  tha  implied  time  order,  i.a. ,  "What  ia  the  probability  of  buying  some 
coffee  (K)  and  going  into  tha  supermarket  (s)?"  An  experiment  to  test  this 
assertion  was  performed  using  graduate  students  with  at  least  one  statistics 
course.  One  group  (n^  -  24)  was  asked  to  choose  how  they  would  represent  "S 
and  K"  probabilistically,  while  a  second  group  (n2  ”  24)  was  asked  to 
represent  "K  and  s".  Subjects  chose  from  either  p(SflK),  p(k|s),  p(sjx), 
or  "none  of  the  above.”  The  results  showed  an  increase  for  p(Sf|K)  when  the 
time  order  was  reversed  (58%  to  75%).  Of  further  interest  was  the  finding 
that  38%  of  the  subjects  chose  p(x|s)  in  the  first  group  (in  accord  with  the 
natural  order  of  the  events)  while  no  subjects  chose  p(k|s)  in  the  second 
group.  Clearly,  temporal  order  is  an  important  cue  that  is  difficult  to 
ignore,  even  when  it  may  be  appropriate  to  do  so. 

A  further  example  of  the  importance  of  temporal  order  in  probability 
judgments  has  been  discussed  by  Tversky  and  Xahneman  (1980).  They  argue  that 
people  judge  the  probability  of  data  that  has  causal  significance  as  being 
greater  than  data  that  have  diagnostic  value,  even  when  the  information  is 
equally  informative  from  a  statistical  point  of  view.  For  example,  compare 
the  probabilities  that:  a  girl  has  blue  eyes  (Y)  if  her  mother  has  blue  eyes 
(X)  vs.  a  mother  has  blue  eyes  if  her  daughter  has  blue  eyes.  Although 
p(Y|x)  -  p(x|y),  making  X  and  Y  equally  informative,  many  subjects  judge 
the  causal/forward  probability  as  greater  than  the  diagnostic/backward  one 
(i.e.,  p(Y|x)  >  p(x|y)).  Tversky  and  Xahneman  also  demonstrate  that  when 
temporal  cues  are  ambiguous,  the  causal  interpretation  of  data  tends  to 
receive  disproportionate  weight  such  that  elementary  rules  of  probability  can 
be  violated. 

The  above  suggests  that  temporal  order  has  a  straightforward  effect  on 
probability  judgments.  However,  the  situation  is  more  complex;  in  particular, 


people  often  have  difficulty  in  distinguishing  between  p(y|x)  and  p(x| Y) 
when  X  and  Y  are  not  equally  informative.  'Kits  "confusion  of  inverse 
probabilities"  (Dawes,  undated)  seems  to  contradict  the  importance  of  temporal 
order  as  a  cue  that  unequivocally  leads  to  causal  probabilities  being  judged 
larger  than  diagnostic  ones.  However,  we  now  discuss  when  the  confusion  is 
most  likely  to  occur  and  argue  that  it  is  due  to  the  ambiguity  of  temporal 
cu<^  in  the  task.  To  illustrate,  consider  the  case  of  mammography  and  breast 
cancer  (Eddy,  1982).  For  women  with  breast  cancer  (C),  the  probability  of  a 
positive  mammogram  (M)  is  .79  (p(M|c)  -  .79).  Eddy  reports  that  most 
physicians  misinterpret  the  statements  about  the  test  and  estimate  the 
probability  of  breast  cancer  given  a  positive  test  to  be  approximately  .75 
(p(c|m)  -  .75).  "When  asked  about  this,  the  erring  physicians  usually  report 
that  they  assumed  that  the  probability  of  cancer  given  that  the  patient  has  a 
positive  X-ray  .  .  .  was  approximately  equal  to  the  probability  of  a  positive 
X-ray  in  a  patient  with  cancer  ....  The  latter  probability  is  the  one 
measured  in  clinical  research  programs  and  is  very  familiar,  but  it  is  the 
former  that  is  needed  for  clinical  decision  making.  It  seems  that  many  if  not 
most  physicians  confuse  the  two."  (Eddy,  1982,  p.  254.) 

Under  what  conditions  is  the  confusion  most  likely  to  occur?  From  our 
perspective,  first  note  that  cancer  causes  the  positive  test  result  and  not 
vice-versa.  Therefore,  denote  cancer  as  the  cause  (X)  that  temporally 
precedes  the  test  as  an  effect  (Y).  However,  the  test  result  is  known  first 
in  time  and  it  is  used  to  predict  the  cause,  X.  The  peculiarity  here  is  that 
the  test  predicts  a  prior  state  (cancer)  that  presumably  existed  before  the 
test  and  which  caused  the  test  result  itself.  Hence,  this  situation  has  a 
loop-like  structure  that  makes  the  temporal  order  of  events  particularly 
confusing.  This  loop-like  structure  is  shown  in  Figure  1 . 


Insert  Figure  1  about  here 


Note  that  the  causal  probability,  p(y|x)  is  not  predictive  in  the  usual 
sense;  i.e. ,  while  having  cancer  predicts  the  test,  this  hardly  seems  relevant 
to  the  practical  matter.  On  the  other  hand,  the  test  (which  is  known  first  in 
time),  is  used  to  predict  the  disease  that  preceded  it  (since  the  disease 
causes  the  test  result) .  Therefore,  there  are  two  temporal  cues  in  problems 
of  this  type:  causes  precede  effects,  but,  effects  are  seen  first  and  used  to 
predict  their  prior  causes.  Under  such  conditions,  confusion  of  inverse 
probabilities  is  quite  understandable.  To  make  matters  worse,  in  statistical 
jargon,  the  "diagnosticity"  of  a  datum  (0)  to  some  hypothesis  (H)  refers  to 
P(d|k).  However,  in  ordinary  discourse,  the  diagnostic  value  of  D  for  H 
means  how  well  does  0  predict  H  (i.e.,  p(H|o)).  Therefore,  to  avoid  such 

confusion,  it  would  be  useful  when  discussing  p(d|h)  to  say  that  D  is 
symptomatic  rather  than  diagnostic  of  H. 

The  next  cue  to  be  considered  is  contiguity  (Q3 ) •  The  extent  to  which 
events  are  contiguous  in  time  and  space  is  an  important  cue  to  causality  as 
evidenced  in  the  work  of  Michotte  (1946).  Siegler  has  also  shown  that  for 
young  children  (5-6  years  old) ,  temporal  contiguity  is  a  very  strong  cue  for 
inferring  causality  (Siegler  £  ULebert,  1974;  Siegler,  1976).  Moreover,  these 
studies  show  that  older  children  are  less  dependent  on  contiguity  alone,  being 
able  to  make  use  of  multiple  cues.  Nevertheless,  contiguity  remains  an 
important  cue  for  directing  attention  to  contingencies  between  variables,  and 
such  contingencies  may  then  be  considered  as  to  their  causal  significance. 

When  temporal  and/or  spatial  contiguity  is  low  (or,  temporal  contiguity  is 
erratic),  inferring  causality  becomes  more  difficult.  That  is,  in  the  absence 


of  contiguity,  relations  are  hard  to  develop  unless  one  uses  intermediate 
causal  models  to  link  the  events  (see  the  next  section).  For  instance,  the 
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Figure  I.  Loop-like  structure  for 
confusion  of  inverse  probabilities 
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tenoral  gap  between  intarcoursa  and  birth  raguiras  some  knowledge  of  human 
biology  and  chemistry  to  maintain  the  links  between  those  events. 

While  the  above  seems  obvious,  it  should  first  be  noted  that  the  sign  and 
form  of  the  relation  between  contiguity  and  judgments  of  probable  cause 
depends  on  one's  knowledge  of  a  particular  situation.  For  example,  imagine 
that  one  has  advertised  a  product  and  sales  go  up  dramatically  the  next  day. 

If  one  believes  that  advertising  works  by  a  gradual  diffusion  process,  the 
sales  increase  may  not  be  attributed  to  the  ad  precisely  because  the  events 
occurred  too  close  in  time.  On  the  other  hand,  contiguity  could  be  seen  as 
monotonic  with  causal  strength  by  others  with  different  theories,  or  by  the 
saute  person  in  another  context.  Second,  contiguity  is  particularly  prone  to 
conflict  with  other  cues,  which  can  then  require  congiromises  of  several 
sorts.  While  we  consider  this  in  more  detail  in  the  next  section,  note  how 
the  belief  that  increased  contiguity  in  time  leads  to  greater  causal  strength. 


can  conflict  with  temporal  order  if  cause  and  effect  occur  simultaneously.  If 


temporal  order  has  priority,  the  relation  between  contiguity  and  judgments  of 


probable  cause  is  not  monotonic. 


The  cue  of  similarity  (denoted  g4)  is  important  to  causal  judgments. 


Like  covariation,  similarity  can  be  modeled  as  a  function  of  its  elements, 
some  of  which  add,  and  some  subtract,  from  its  strength.  That  is,  following 
Tversky  (1977),  similarity  judgments  can  be  defined  as  a  weighted  linear 
function  of  the  common  elements  of  two  objects  (cf.  cell  A  in  covariation) 
minus  the  distinctive  elements  of  each  (cf.  cells  B  and  C).  However,  to 
extend  this  conception  of  similarity  from  objects  to  causes  and  effects,  it  is 
necessary  to  specify  the  common  and  distinctive  elements  of  the  latter.  These 
can  be  considered  at  several  levels.  First,  there  is  a  long-standing  notion 
that  cause  and  effect  should  exhibit  some  degree  of  physical  resemblance. 
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Mill  noted  that  this  Is  a  deeply  rooted  belief  that,  "not  only  reigned  supreme 
in  the  ancient  world,  but  still  possesses  almost  undisputed  dominion  over  many 
of  the  most  cultivated  minds”  (cited  in  Nisbett  S  Ross,  1980,  p.  115).  Mill 
thought  that  such  a  belief  was  erroneous  and  many  cases  exist  in  which 
physical  resemblance  has  been  misleading.  For  example,  Nisbett  and  Ross 
(1980)  point  out  that  physical  resemblance  was  the  cornerstone  of  a  medical 
theory  called  the  "doctrine  of  signatures"  whereby  cures  for  diseases  were 
thought  to  be  marked  by  their  resemblance  to  the  symptoms  of  the  disease. 

Thus,  the  curing  of  jaundice  was  attributed  to  a  substance  that  had  a  brillant 
yellow  color  (see  also  Shapiro,  1960;  Shweder,  1977).  However,  whereas 
physical  resemblance  may  be  a  cue  of  low  validity,  it  does  not  mean  it  has  no 
validity.  Indeed,  there  are  many  examples  of  where  it  is  useful. 

At  a  second  level,  one  can  consider  similarity  based  on  the  congrulty  of 
the  length  and  strength  of  cause  and  effect.  That  is,  if  the  effect  of 
interest  is  large  (i.e.,  is  of  substantial  duration  and/or  magnitude),  people 
will  expect  the  cause(s)  to  be  of  cooparable  size.  For  example,  the  germ 
theory  of  disease  advanced  by  Pasteur  must  have  seemed  incredible  to  his 
contemporaries  in  that  people  were  asked  to  believe  that  invisible  creatures 
caused  death,  plagues,  and  so  on.  In  the  same  way,  it  is  equally  difficult 
for  many  to  believe  that  billions  of  dollars  spent  on  social  programs  in  the 
’60s  and  ’70s  could  have  had  little  or  no  effect,  or  that  long  term  and 
complex  effects  like  poverty  can  have  short  term  and  simple  causes. 

Determinants  of  Gross  Strength 

Causal  chains,  while  the  causal  background  and  cues-to-causality  are 
important  in  determining  the  gross  strength  of  an  explanation,  are  other 
factors  involved?  Several  authors  have  argued  that  attributions  of  physical 
causation  depend  on  the  perception  of  a  "generative"  force  that  links  causes 


to  effects  (e.g.,  Shultz,  1982).  This  implies  a  strong  mechanistic  conception 
of  causation  in  which  effects  result  from  a  physical  transmission  of  causal 
"energy”  from  X  to  1>  The  importance  of  this  idea  is  that  it  implies  that 
events  need  to  be  linked  via  a  causal  chain  so  that  the  force  can  be 
transmitted  from  one  link  to  the  next.  Conversely,  if  no  connecting  links  can 
be  made  between  X  and  Y,  the  strength  of  a  causal  explanation  will  be  zero. 

To  illustrate,  how  likely  is  it  that  sun-spots  (X)  cause  price-changes  on  the 
stock  market  (Y)?  In  order  to  link  these  events,  one  needs  a  causal  chain 
that  bridges  the  spatial  and  temporal  gap  (assume  that  temporal  order  and 
covariation  both  point  to  a  causal  relation  and  that  prices  are  affected  six 
months  after  sun-spot  activity).  If  someone  cannot  construct  a  causal  chain 
to  link  X  and  Y,  we  argue  that  this  is  sufficient  for  zero  gross  strength. 

Put  differently,  the  existence  of  a  causal  chain  is  necessary  for  non-zero 
gross  strength  regardless  of  the  levels  of  the  cues.  Therefore,  causal 
relations  must  be  "achieved”  in  the  sense  that  prior  knowledge  and  imagination 
are  needed  to  construct  a  schema,  scenario,  or  chain,  to  link  cause  and 
effect.  On  the  other  hand,  a  scenario  that  does  link  X  and  Y  may  be  so  weak 
as  to  result  in  a  zero  or  near  zero  gross  strength.  For  example,  imagine  the 
following  causal  chains  sun-spots  affect  weather  conditions,  which  affect 
agricultural  production,  which  affect  economic  conditions,  which  affect 
profits,  which  affect  stock  prices.  However,  before  explicating  the 
implausible  nature  of  this  chain,  we  need  to  consider  the  role  cf  the  cues 
and  background  in  the  construction  of  such  chains. 

If  a  causal  chain  is  necessary  for  linking  cause  and  effect,  what  rules 
govern  its  construction?  of  particular  importance  is  the  fact  that  the  cues- 
to-causality  constrain  the  possible  causal  chains  that  can  be  constructed. 

This  is  most  obvious  in  considering  the  cue  of  temporal  order.  That  is,  if  X 
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does  not  temporally  precede  Y,  it  cannot  be  a  cause  of  Y.  However,  the  cues 
of  contiguity  in  time  and  space,  congruity  (i.e.,  similarity  of  length  and 
strength  of  cause  and  effect)  and  covariation  also  constrain  the  linkage 
process.  First,  consider  contiguity  in  time  and  space  and  recall  the  sun¬ 
spots  and  stock  prices  example,  in  order  to  link  these  events  at  all,  it  was 
necessary  to  bridge  the  spatial  and  temporal  gap  by  positing  a  change  in 
weather,  etc.  However,  imagine  that  price  changes  occur  immediately  after 
sun-spot  activity  (rather  than  six  months  later) .  Note  that  the  high 
contiguity  in  time  precludes  the  weather-economic  conditions-scenario  since 
that  scenario  requires  a  time  delay  between  X  and  Y.  Hence,  in  order  to  link 
X  and  Y,  another  scenario  is  necessary  that  is  consistent  with  high  contiguity 
in  time  (and  low  contiguity  in  space).  Similarly,  imagine  taking  up  smoking 
and  getting  lung  cancer  the  next  week.  It  seems  highly  unlikely  that  smoking 
is  the  cause  since  a  causal  chain  with  high  contiguity  in  time  is  not  easy  to 
construct.  Hew  consider  incongruity  of  cause  and  effect;  i.e.,  small 
causes/big  effects  or,  big  causes/small  effects.  To  account  for  incongruity, 
the  linkage  process  must  involve  some  type  of  "amplification”  in  the  first 
instance,  and  "dampening"  in  the  second.  For  example,  re-consider  the 
incongruity  between  germs  and  illness/death,  as  exemplified  by  the  germ  theory 
of  disease,  without  knowledge  of  how  germs  enter  the  body,  multiply, 
disseminate  throughout  the  system,  and  so  on,  it  would  be  difficult  (and  was 
difficult  at  the  time),  to  accept  such  a  theory. 

A  summary  of  the  effects  of  the  cues  on  constraining  the  type  of  causal 
chain  necessary  to  link  X  and  Y  is  given  in  Table  2,  where  the  cues  of 
contiguity,  and  congruity  are  shown  at  each  of  two  levels.  Constrast,  for 

^2S2E§_f25i®l2~about_here 

example,  cells  1  and  4.  In  the  former,  the  two  cues  have  high  values  such 


that  one  would  expect  a  model  requiring  few  links,  if  any.  Recall,  for 
instance,  the  example  of  a  hammer  hitting  a  watch  face  and  causing  the  glass 
to  break.  In  the  latter  (cell  4),  both  cues  are  low  and  therefore  place  quite 
strong  demands  on  the  type  of  causal  chain  that  can  meet  specifications. 

There  is  an  important  connection  between  the  construction  of  a  chain  that 
meets  the  constraints  of  the  cues  and  the  strength  of  that  chain.  To  expli¬ 
cate  the  nature  of  this  relation,  we  first  denote  as  the  strength  of  the 

causal  chain  that  links  X  with  Y.  If  one  cannot  construct  a  linkage  between  X 
and  Y,  -  0.  If  a  causal  chain  can  be  constructed,  then  measures  the 

strength  of  that  chain.  In  order  to  discuss  the  factors  that  determine  Q^, 
recall  the  sun-spot-stock-price  causal  chain  given  above: 


Sun-spots  ♦  weather  +  agriculture  ♦  economy  ♦  profits  ♦  price-changes 
X  ♦  X.  ♦  X_  Y 
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Dote  that  while  this  scenario  meets  the  constraints  in  Table  2,  it  requires  a 
long  chain  of  events  to  bridge  the  temporal  and  spatial  gap  (we  would  argue 
that  congruity  is  not  a  problem  here).  Let  &  be  defined  as  the  number  of 
links  in  the  constructed  chain,  where  l  is  equal  to  the  number  of  inter¬ 
mediate  variables  in  the  chain  (small  x’s)  plus  one.  Thus,  in  the  above  chain 
there  are  five  links,  what  determines  the  strength  of  each  link,  and,  how  do 
the  individual  link-strengths  combine  to  determine  the  strength  of  the  whole 
chain  (i.e.,  Q^)? 

Before  individual  link -strength  can  be  determined,  the  constraints  of 
temporal  order,  contiguity,  and  congruity  must  have  been  met.  Therefore, 
we  argue  that  it  is  the  covariation  between  variables  at  each  link  that 
determines  link-strength.  For  example,  the  link  between  sun-spots  and  weather 
is  due  to  the  covariation  of  X  with  »  the  weather-agricultural  production 
link  is  due  to  the  covariation  of  x.  with  Xti  and  so  on.  Now  consider  how 


the  individual  links  conbina  to  determine  the  strength  of  the  causal  chain, 

Ql*  To  examine  this,  let  Cj  be  the  covariation  of  variables  at  the  jth  link 
in  the  chain  (j  -  1,  2,  . ..,  1)  and  assume  that  the  relation  between  the 
Cj's  and  is  as  follows: 


«L-  *  “j 

3-1  3 


(5) 


By  assuming  a  multiplicative  function,  equation  (5)  implies  the  follow¬ 
ing:  (1)  The  strength  of  a  chain  is  at  best,  equal  to  its  weakest  link. 

Thus,  if  c.|  has  the  lowest  link-strength  in  the  chain,  is  at  best  equal 

to  c1  (if  all  other  links  are  1).  Indeed,  note  that  if  any  c^  -  0,  - 


0  regardless  of  the  strength  of  the  other  links.  Also  note  that  since  c 
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is  between  0  and  1,  the  longer  the  chain  (A),  the  lower  the  strength  of  the 
whole  chain  (since  one  is  multiplying  fractions).  Therefore,  equation  (5) 
accords  well  with  some  basic  intuitions  regarding  the  relations  between  the 
number  and  strength  of  links  in  determining  the  strength  of  the  chain;  (2)  The 
effects  of  covariation  at  the  link-level  and  the  number  of  links  in  a  chain 
can  be  more  easily  understood  by  approximating  equation  (5)  using  the  average 

covariation  between  links,  c,  for  Cj.  Thus, 

/ 

l 


II  c  ■  "c  * 


(6) 
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where. 


c  -  (1/A)  l  c 
3-1  3 


Equation  (6)  makes  clear  why  the  sun-spot-stock-price  scenario  is  implausible 
on  the  basis  of  Since  A  ■  5  in  that  scenario,  even  if  c”  ■  .7, 

is  only  .17.  Therefore,  long  chains  are  generally  weaker  than  short 
ones.  However,  equation  (6)  also  implies  that  if  c  is  high,  long  chains 
can  be  stronger  than  short  chains  with  lower  c.  Indeed,  when  c  »  1,  the 


length  of  the  chain  is  irrelevant.  For  example,  consider  the  following 
scenario: 

Mrs.  Jones,  a  middle-aged  lady,  was  looking  for  her  seat  at  a  foot¬ 
ball  game.  While  trying  to  ease  past  some  other  spectators  on  a 
steep  staircase,  she  lost  her  balance  and  fell  over.  She  hit  another 
spectator,  who  was  also  off  balance.  This  person,  in  turn,  fell  on 
someone  else  who  unwittingly  pushed  Mr.  Smith,  a  70-year  old  man, 
against  soma  iron  railings.  Mr.  Smith  broke  a  leg  in  the  incident. 
How  likely  did  Mrs.  Jones'  fall  cause  Mr.  Smith  to  break  his  leg? 

We  designate  such  scenarios  as  "domino-chains,"  for  obvious  reasons.  Hie 
basic  characteristics  of  these  chains  is  a  high  "c,  high  %,  and  high  QL. 

On  the  other  hand,  when  ~c  is  low,  even  moderate  l  leads  to  a  low  value 
of  Q^j  (3)  Equations  (5)  and  (6)  suggest  that  the  level  of  the  causal  back¬ 
ground  can  also  affect  by  changing  the  number  of  links  in  the  causal 

chain.  TO  see  this,  recall  the  child's  game  of  asking  "why?"  after  each 
answer.  This  game  highlights  the  fact  that  the  links  in  a  causal  chain  are 
subject  to  further  analysis  at  a  more  reductionistic  level.  As  pointed  out 
earlier,  the  causal  background  implies  a  particular  level  of  explanation  and 
as  such,  it  constrains  the  type  and  length  of  the  causal  chain  used  to  connect 
X  with  Y.  In  particular,  as  each  link  is  decomposed  into  further  components, 
the  number  of  total  links  (1)  increases.  However,  unless  the  Cj 's  also 
increase  (which  is  possible  since  the  phenomenon  may  be  better  understood  at 
lower  levels),  could  be  seen  to  decrease.  Clearly,  the  rates  at  which 

t  and  Cj  increase  (if  they  do)  determine  whether  is  higher  or  lower  at 

different  levels  of  the  causal  background. 

A  combining  rule  for  gross  strength.  As  should  be  clear  from  the 
preceding  sections,  judgments  of  gross  strength  are  composed  of  numerous 


factors.  To  recapitulate.  Table  3  lists  these  factors  Including  sons  of  the 
assumed  relations  between  them.  Note  that  we  have  also  specified  the  causal 
background  by  a  variable  Qg  for  reasons  to  be  explicated  below. 

Iiji®i§”f2kle_3_about_here 

Evidence  concerning  the  manner  in  which  the  variables  listed  in  Table  3 
combine  in  affecting  gross  strength  can  be  found  in  many  areas  of  psychology. 
Moreover,  this  evidence  can  be  conceptualized  as  illuminating  two  key  issues: 
(a)  what  are  the  necessary  conditions  for  perceiving  causal  relations?  and  (b) 
How  are  conflicts  between  the  cues  resolved? 

(a)  Three  variables  can  be  considered  necessary;  Qg,  Q2'  and  Ql*  As 
discussed  previously,  if  a  variable  is  not  perceived  as  a  dif ference-in-a- 
field,  it  will  not  be  deemed  causally  relevant.  Similarly,  temporal  order  is 
a  necessary  condition  since  causes  must  precede  effects.  Now  consider  the 
evidence  regarding  the  necessity  of  Q^.  Several  researchers  have  demon¬ 
strated  that  despite  the  presence  of  strong  cues-to-causality,  people  will 
still  fail  to  make  causal  attributions  if  they  are  unable  to  perceive  a  causal 
link  (i.e.,  ■  0).  For  example,  whereas  Michotte  (1946)  shored  that 

subjects  perceived  causal  relations  when  the  movement  of  objects  after  contact 
was  congruent  with  prior  trajectories  and/or  positions,  this  was  not  the  case 
when  one  object  touched  the  other  and  the  latter  changed  color,  got  larger,  or 
rose.  Consider  further  the  literature  demonstrating  the  limits  of  classical 
conditioning.  For  instance,  although  Watson  and  his  colleagues  were  able  to 
condition  little  Albert  to  fear  rabbits  by  pairing  the  appearance  of  a  rabbit 
with  that  of  a  large  noise,  they  could  not  produce  the  same  effect  when  the 
rabbit  was  replaced  by  a  block  of  wood  or  a  cloth  curtain  (Nisbett  £  Ross, 
1980,  p.  104).  Similarly,  Garcia  and  his  colleagues  (Garcia  et  al.,  1968; 
Garcia  et  al. ,  1972;  Garcia,  1981)  have  demonstrated  that  rats  can  learn  to 


Qb  ■  degree  to  which  X  is  a  dif ference-in-the  background  (0  <  QB  <  1 ) 


Ql  »  covariation  of  X  with  Y 


(0  <  Q1  <  1 ) 


Q2  *  temporal  order  of  X  and  Y  (0,  1 ) 


Q3  *  contiguity  in  time  and  space 
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similarity  of  cause  and  effect 


<congruity 

physical  similarity  (Q 


-  causal  chain  strength  (0  <  gL  <  i ) 


0  if  no  construction 
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j5i  cj  otherwise;  *  c 


where,  *  *  number  of  links  in  the  causal  chain 
*  f(Q3  ,  congruity) 

0^  »  covariation  at  jth  link 
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associate,  after  one  trial,  distinctive  tasting  food  and  a  gastro-intestinal 
illness  (induced  by  X-rays)  several  hours  later,  i.e.,  despite  a  large 
temporal  gap.  However,  the  rats  do  not  learn  to  associate  a  differential 
shape  of  food  to  the  illness,  when  the  taste  is  familiar.  In  a  provocative 
review  of  this  and  other  literature  on  learning,  Seligman  (1970)  suggested 
that  organisms  are  differentially  prepared  to  learn  different  types  of 
relations.  Specifically,  Seligman  argued  that  stimuli  can  be  arranged  along 
continuum  of  preparedness,  from  "contra-prepared”  through  "prepared, "  that 
indicates  the  ease  with  which  organisms  can  learn  to  perceive  contingencies. 
In  essence,  «  0  for  contra-prepared  relations.  However,  the  extent  to 
which  the  source  of  this  phenomenon  is  biological  and  could  be  overcome  by 
relevant  environmental  contingencies  is  controversial. 

If  Qg,  q2,  and  are  all  necessary  for  the  perception  of  a  causal 

relation  ,  they  should  combine  multiplicatively  in  affecting  gross  strength, 
since  if  any  variable  is  0,  so  is  gross  strength.  On  the  other  hand,  since 
the  other  variables  in  Table  3  are  not  necessary,  their  effect  on  gross 
strength  should  be  compensatory .  indeed,  our  review  of  the  literature  (see 
below),  suggests  the  following  functional  form  for  relating  gross  strength  to 
the  cues, 

a(X,Y)  -  Q,QOUrQT  +  \0,  +  A  O')  (7) 
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where  the  A's  are  attentional  weights  (X,  +  A  +  A  ■  1).  Note  that  while 

L  1  4 

contiguity  and  congruity  are  not  explicitly  shown  in  the  equation,  they  are 

included  via  their  effects  on  Q^.  That  is,  since  contiguity  and  congruity 

—  I 

are  a  decreasing  function  of  1,  and,  since  3  c  these  cues  do  affect 
gross  strength.  We  now  consider  further  evidence  supporting  the  form  of 
equation  (7).1 


(b)  Several  studies  have  examined  how  the  cues  trade-off.  In  develop¬ 
mental  psychology,  for  example,  Sedlak  and  Kurtz  (1981)  have  reviewed  research 
on  three  cues:  temporal  order,  covariation,  and  contiguity.  In  summarizing 
the  results  of  studies  where  these  cues  conflict,  they  stated: 

When  a  temporal  delay  is  introduced,  only  children  8  years  of 
age  or  older  offer  reliable  evidence  of  covariations.  Specif¬ 
ically,  the  covariation  use  of  younger  children  deteriorates 
when  the  covarying  cause  is  temporally  non-contiguous  with  the 
effect  and  they  encounter  problems  in  ( 1 )  rationalizing  the 
delay,  or  (2)  recognizing  the  constancy  of  the  dexay.  Where 
these  tasks  are  made  difficult,  a  temporally  contiguous  (albeit 
inconsistent)  cause  is  preferred.  Thus  we  can  tentatively  rank 
order  the  three  Humean  principles.  It  appears  that  temporal 
antecedence  supercedes  spatial  contiguity  (Bullock  £  Gelman, 

1979),  and  that  temporal  (and  spatial)  contiguity  may  sometimes 
outweight  the  covariation  factor  in  children's  judgments 
(Mendelson  &  Shultz,  1976;  Siegler,  1975;  Siegler  &  Liebert, 

1974).  (Sedlak  fi  Kurtz,  1981,  p.  763.) 

Furthermore,  in  studies  that  have  contrasted  covariation  with  similarity, 
Shultz  and  Ravinsky  (1977)  found  that  6-year  olds  were  unwilling  to  label 
dissimilar  factors  as  causes,  even  in  the  presence  of  systematic  covari¬ 
ation.  On  the  other  hand,  older  children  (10-12  years  old)  favored  covari¬ 
ation  over  similarity.  They  also  found  that  the  relative  weights  given  to 
similarity  vs.  temporal  contiguity  varied  according  to  age;  similarity  out¬ 
weighed  contiguity  for  6-year  olds,  but  older  children  favored  contiguity  over 
similarity.  In  studies  involving  adult  subjects,  Einhorn  and  Hogarth  (1982a) 
varied  covariation,  contiguity,  and  similarity  in  a  factorial  design.  In  one 
experiment,  covariation  traded-off  with  similarity  but  there  was  no  effect  for 
contiguity.  However,  when  contiguity  was  made  more  salient  in  the  experi¬ 
mental  stimuli,  it  did  trade-off  with  covariation  and  similarity  thereby 
indicating  the  effects  of  attention  as  captured  by  the  X's  in  equation  (7). 
Finally,  it  is  interesting  to  note  that  conflicts  between  similarity 
(congruity)  and  contiguity  can  sometimes  be  resolved  by  distinguishing 
between  "precipitating"  and  "underlying"  causes.  The  former  is  generally 


som  action  or  event  that  is  high  in  temporal  and  spatial  contiguity  but  low 
in  similarity  of  length  or  strength  with  the  event.  The  latter  is  generally 
based  on  high  similarity  of  length  and  strength,  with  contiguity  being  less 
important.  For  example,  the  precipitating  cause  of  World  War  I  was  an 
assassination  in  Sarajevo,  but  the  underlying  cause (s)  were  economic  upheaval, 
German  nationalism,  and  so  on. 

An  important  aspect  of  equation  (7)  is  that  it  includes  both  the 
necessity  of  certain  cues  and  trade-offs  amongst  others.  This  feature  of 
our  model  helps  to  integrate  the  views  of  those  who  have  argued  both  for  and 
against  the  priority  of  cues-to-causality.  On  the  one  hand,  researchers  such 
as  Shultz  (1982)  have  presented  convincing  evidence  that  children  will  ignore 
certain  cues-to-causality  when  these  conflict  with  a  perceived  link  between 
cause  and  effect  (Q^  >  0) .  Shultz  interpreted  his  results  as  indicating  that 
the  rules  governing  attributions  of  causality  give  precedence  to  notions  of 
"generative  transmission"  over  the  (Humean)  cues-to-causality.  On  the  other 
hand,  we  believe  that  posing  these  issues  as  a  controversy  between  the  Humean 
or  "regularity"  approach  to  causality  and  the  "generative"  theory  is 
misplaced.  First,  we  stress  the  notion  that  the  cues-to-causality  are  not 
"rules;"  they  are  imperfect,  probabilistic  indicators  (Brunswik,  1952; 
Campbell,  1966).  Ttius,  whereas  the  presence  of  the  cues  can  strengthen 
judgments  of  probable  cause,  they  do  not  establish  it  as  such.  Indeed,  the 
discovery  of  new  causal  mechanisms  often  involves  thinking  in  ways  that 
counteract  some  of  the  cues.  (This  is  discussed  further  below.)  Second, 
since  the  cues  play  a  crucial  role  in  delimiting  the  particular  models  of 
"generative  transmission"  that  people  use  in  causal  reasoning,  it  cannot  be 
said  that  the  cues  per  se  are  disregarded  in  favor  of  the  latter,  ttiird, 
while  cur  approach  puts  considerable  emphasis  on  generative  transmission  via 
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Q^,  it  also  allows  for  trade-offs  if  is  above  a  threshold  level,  There¬ 

fore,  if  the  Humean  principles  are  given  a  probabilistic  interpretation  and 
gross  strength  is  comprised  of  both  compensatory  and  noncompensatory  aspects, 
we  see  considerable  complementarity  between  the  two  approaches. 

While  "generative*  links  are  represented  in  equation  (7)  by  Q^,  can 

also  be  taken  to  represent  the  strength  of  any  theory  (whether  acquired  via 
experience  or  "innate")  that  people  bring  to  bear  in  causal  reasoning.  Thus, 
can  be  likened  to  a  schema  that  the  person  brings  to  bear  in  a  particular 
situation  (Schank  a  Abe Ison,  1977;  Abelson,  1981).  Such  schemas  are  usually 
considered  to  be  based  on  past  experience  and  world  knowledge  and  have  been 
studied  from  a  variety  of  viewpoints.  (See  also  the  growing  literature  on 
"story  grammars,"  Stein  fi  Glenn,  1979;  Trabasso  et  al. ,  1984;  Trabasso  £ 
Sperry,  1984.)  As  the  reader  will  note,  we  have  allowed  to  enter 

equation  (7)  twice;  once  as  a  necessary  condition  (outside  the  parentheses), 
and  once  in  a  fora  that  allows  trade-offs  with  both  covariation  and  physical 
similarity,  this  fora  of  equation  was  chosen  to  reflect  the  considerable 
literature  summarized  by  Nisbett  and  Ross  (1980)  under  the  heading  of  "theory 
driven  vs.  data  driven"  judgments  of  covariation.  Zn  short,  Nisbett  and  Ross 
show  that  there  is  much  evidence  to  suggest  that  people  will  often  allow  their 
prior  conceptions  of  relations  (i.e.,  Q^)  to  outweigh  considerations  of  more 

data-driven  estimates  of  covariation  (i.e.,  Q, ).  Thus,  even  if  X1Q1  is 
zero,  strong  causal  relations  can  still  be  perceived.  In  analogous  fashion, 
Shweder  (1977)  has  summarized  much  literature  that  goes  under  the  heading  of 
"illusory  correlation."  equation  (7)  allows  for  this  phenomenon  since 
judgments  of  gross  strength  can  be  high  if  the  lack  of  covariation  is 
compensated  for  by  prior  theory  (i.e.,  Q^)  and/or  physical  similarity  between 


the  variables  (i.e.,  Q'^) 


The  tension  between  "theory”  and  "data"  (i.e. ,  and  Q1 )  that  is 

captured  in  (7)  is  also  helpful  for  understanding  a  perennial  methodological 
issue:  spurious  correlation.  The  existence  of  this  concept  suggests  that 
soma  correlations  are  more  {or  less)  causally  related  than  others,  and  thereby 
raises  the  issue  of  how  to  tell  the  difference  (cf.  Simon,  1954).  For 
example,  consider  the  correlation  between  the  number  of  pigs  and  the  amount  of 
pig-iron  (Ehrenberg,  1975).  Such  a  correlation  seems  spurious  when  the  common 
causal  factor,  "economic  activity,"  is  considered.  On  the  other  hand, 
consider  the  correlation  between  amount  of  rain  and  number  of  auto  traffic 
accidents  in  a  city,  over  the  course  of  a  year.  Such  a  correlation  does  not 
seem  spurious  (or  at  least,  less  spurious).  What  is  the  difference  between 
these  two  cases? 

If  one  makes  use  of  equation  (7),  the  spuriousness  of  the  correlation 
between  pigs  and  pig-iron  becomes  apparent.  That  is,  although  the  covariation 
cue  points  to  a  causal  relation,  temporal  order  cannot  be  used  to  specify 
■***•* **))  is  cause  or  effect  (Qj  “  0);  and,  most  people  cannot  construct 

a  causal  chain  II . .  |  1 1|  I  H  n|o-<™n  ■  o).  Indeed,  the  judgment  that 

the  relation  is  spurious  is  made  easily.  II . . 1  IWl  Shir  pg - J  case: 

assuming  that  there  is  a  statistical  relation  (Q,  >  0),  note  how  the  other  ~ 
cues  reinforce  that  link.  The  temporal  order  of  rain  and  accidents  is  clear; 
it  is  relatively  easy  to  construct  a  chain  linking  the  controllability  of  cars 
on  slippery  roads  (as  long  as  there  is  high  contiguity  in  time  and  space 
between  rain  and  accidents);  and,  rain  is  not  a  standing  state  or  condition. 
There  seems  less  doubt  that  the  correlation  is  "real." 

When  cues-to-causality  conflict,  spurious  correlation  is  not  the  only 
outcome;  e.g.,  a  low  or  zero  statistical  correlation  could  mask  a  true  -  >sal 
relation  (Einhorn  &  Hogarth,  1982b).  To  illustrate,  Imagine  that  we  were 


ignorant  aa  to  tha  cauaa  of  birth.  However,  it  has  been  suggested  that  sexual 
intercourse  is  related  to  pregnancy  and  the  following  experiment  was  designed 
to  test  this  hypothesis!  100  couples  were  allocated  at  random  to  an  inter¬ 
course  condition,  and  100  to  a  non-intercourse  condition.  As  indicated  in 
Table  4,  25  females  became  pregnant,  and  175  did  not.  In  light  of  our  current 

|nsert~Table~4 ~about_here 

knowledge  (but  unknown  to  our  hypothetical  selves),  we  can  state  that  the  5 
people  in  the  no-intercourse/yes-pregnancy  cell  represent  "measurement  error," 
i.e.,  faulty  memory  in  reporting,  lying,  etc.  Since  the  statistical  cor¬ 
relation  is  small  (r  -  0.34),  we  might  question  whether  the  hypothesis  is 
worth  pursuing.  Indeed,  if  the  sample  size  were  smaller,  the  correlation 
might  not  even  be  "significant."  Moreover,  even  with  a  significant 
correlation,  r2  -  0.12,  which  is  hardly  a  compelling  percentage  of  the  Y 
variance  accounted  for  by  X. 

There  are  two  important  implications  of  this  example.  First,  whereas 
statistics  texts  correctly  remind  us  that  correlation  does  not  necessarily 
imply  causation,  the  imperfect  nature  of  this  cue-to-causality  is  also 
reflected  in  the  statement:  causation  does  not  necessarily  imply  correlation. 
we  have  somewhat  facetiously  labeled  examples  of  the  latter  as  "causalations,  ” 
-gxvi ng^them  equa 1  standing  with  the  batter-known  and  opposite  concept  of 
spurious  correlation^'  'Second,  causalation  demonstrates  that  sole  reliance  on 
a  single  cue,  such  as  covariation,  is  inadequate  for  understanding  causal 
relations.  Indeed,  as  equation  (7)  shows,  gross  strength  can  be  non-zeroaven 
if  Q1  is  low.  Therefore,  the  use  of  multiple  cues  highlights  the  role  of 
judgment  in  interpretating  data  (see  also  Simon,  1954)  and  equation  (7) 
provides  a  basis  for  understanding  how  this  is  done. 

The  examples  cited  so  far  have  all  Involved  physical  causation.  However, 


our  framework  can  also  be  applied  to  judgments  of  probable  cause  in  the  social 
domain.  Consider,  for  example,  judgments  of  probable  cause  where  actions  are 
explained  by  motivations,  intentions,  or  goals.  Zn  these  cases,  once  the 
attributor  postulates  a  motive,  note  that  the  motive  has  the  appropriate 
temporal  order  (since  the  existence  of  the  motive  is  assumed  to  precede  the 
act),  high  temporal  and  spatial  contiguity  are  assured,  and  the  judgment  often 
requires  constructing  only  a  simple  link  model  to  go  from  the  existence  of  the 
motive  to  the  action.  Our  analysis  therefore  suggests  that  in  distinguishing 
between  causal  candidates,  judgments  of  probable  cause  in  the  social  domain 
will  depend  primarily  on  two  cues,  covariation  and  the  salience  of  x  as  repre¬ 
sented  by  the  degree  to  which  it  is  a  difference-in-the-background  (i.e., 

Qg).  Indeed,  it  is  instructive  to  note  that  both  covariation  and  salience 
have  been  the  subject  of  intense  investigation  in  the  literature  on  social 
attribution  (Hastie,  1983;  Harvey  aweary,  1984).  Kelley's  (1967,  1973) 
influential  ANOVA  model,  for  example,  was  constructed  solely  on  the  basis 
of  covariation  patterns. 

Role  of  specific  alternatives 

The  importance  of  specific  alternative  explanations  in  assessing  the 
strength  of  causal  candidates  has  been  stressed  by  many.  Of  particular 
importance  is  the  work  of  Campbell  and  colleagues  on  threats  to  "internal 
validity”  (Campbell  fi  Stanley,  1963;  Campbell,  1969;  Cook  £  Campbell,  1979). 
Indeed,  the  assessment  of  internal  validity,  whereby  one  asks  what  factors 
other  than  X  could  have  produced  Y,  is  a  basic  component  of  causal  inference. 
Similarly,  attribution  theorists  (e.g.,  Kelley,  1973;  Jones,  1979)  have 
discussed  the  discounting  of  a  causal  explanation  on  the  basis  of  plausible 
alternatives.  From  our  perspective,  specific  alternatives  are  important 
beyond  their  obvious  role  in  lowering  the  covariation  between  X  and  Y  (by 


increasing  the  frequency  of  the  C  cell).  The  reason  is  that  specific  alter¬ 
natives  allow  one  to  use  the  cues-to-causality  to  assess  their  plausibility 
(i.e. ,  determine  the  gross  strength  of  alternatives).  Hence,  the  content 
knowledge  associated  with  a  specific  alternative  affects  its  gross  strength 
(via  equation  (7)),  which  in  turn  is  used  to  discount  the  original  explanation. 

The  notion  of  a  discounting  process  provides  an  important  connection 
between  research  on  causal  reasoning  and  probabilistic  inference  since  much  of 
the  latter  concerns  how  people  update  their  present  beliefs  on  the  basis  of 
new  information.  Therefore,  the  discounting  of  a  causal  explanation  by 
specific  alternatives  involves  a  process  of  updating  one's  (causal)  beliefs  on 
the  basis  of  new,  and  in  this  case,  negative  evidence  (i.e.,  alternative 
explanations) .  Moreover,  the  idea  that  judgments  of  probable  cause  result 
from  an  updating  process  focuses  attention  on  the  dynamic  processes  underlying 
judgment  and  choice  (Hogarth,  1 981 ) .  Indeed,  we  now  discuss  several  issues 
that  follow  from  considering  judgments  of  probable  cause  as  resulting  from  an 
updating  process. 

To  begin,  recall  equation  (1),  in  which  the  net  strength  of  a  causal 
explanation  after  k  specific  alternatives,  SjjfXjY),  increases  with  the 
gross  strength  of  X,  sQ(x,Y),  and  decreases  with  the  gross  strengths  of 
alternatives  Z^,  stZ^,  Y).  To  illustrate  the  substantive  issues  involved  in 
specifying  the  discounting  process,  consider  the  following  model: 

SX(X,Y)  -  Sjt_1  (X, Y)  -  w^  »(ZJc»X)  (8) 

where,  w^  is  a  weight  that  reflects  the  importance  given  to  the  Z^  alter¬ 
native  (to  be  discussed  below).  Equation  (8)  can  be  interpreted  in  a  variety 
of  ways;  e.g.,  a  sequential  anchoring-and-adjustment  model,  a  " Bayes ian-like" 
update  model,  etc.  From  the  present  perspective,  equation  (8)  is  useful 


because  it  raises  three  general  issues:  (a)  Why  is  the  model  subtractive  in 
fora,  rather  than  a  ratio?;  (b)  Bov  is  the  plausibility  of  an  alternative 
affected  by  other  alternatives,  as  veil  as  by  its  similarity  to  X?;  (c)  What 
factors  affect  the  importance  veight,  v? 

(a)  Consider  a  situation  in  vhich  there  are  no  specific  alternatives  to 
explanation  X.  A  ratio  model  of  the  form, 


Sfc(X,Y)  - 


Sfc-I1*-*1  -«>  »<v*> 


<S> 


would  alvays  yield  Sk(X,Y)  -  1,  vhich  is  clearly  unacceptable.  Indeed,  an 
important  implication  of  the  subtractive  form  is  that  the  net  strength  of  an 
explanation  can  be  low  even  though  specific  alternatives  do  not  exist  (vhen 
gross  strength  is  lov).  On  the  other  hand,  if  gross  strength  is  high  and  few 
specific  alternatives  exist,  the  explanation  will  be  seen  as  a  highly  probable 
cause  even  if  the  inference  is  based  on  a  one-shot  case  study  (see  Campbell, 
1975,  for  an  illuminating  discussion  of  this  issue).  As  a  case  in  point, 
consider  the  occurrence  of  a  huge  explosion  near  Los  Alamos,  New  Mexico,  in 
July  1945.  Mo  one  doubted  this  to  be  the  effect  of  detonating  an  atomic 
bomb.  Clearly,  inferring  causality  in  this  poorly  designed  experiment  was 
not  difficult  whereas  assessing  causality  in  the  most  meticulously  designed 
experiments  in  social  science  is  often  problematic  at  best.  When  one 
considers  why  the  causal  inference  is  so  strong  in  the  bomb  example,  first 
note  that  there  is  a  good  link  for  connecting  the  events  (atomic  physics)  and 
the  cues-to-causality  strongly  point  to  a  causal  relation.  Second,  ask 
yourself  the  following  question:  "Would  an  explosion  of  such  magnitude  have 
occurred  if  an  atomic  bomb  had  not  gone  off?"  While  it  is  possible  to  think 
of  alternative  explanations  for  the  explosion,  they  are  so  unlikely  as  to  be 


virtually  non-existent.  Therefore,  a  good  explanation  has  high  gross  strength 
and  few  plausible  alternatives. 

(b)  A  aajor  limitation  of  the  model  represented  by  equation  (8)  concerns 
the  similarity  or  redundancy  of  alternatives  to  the  original  explanation  and 
to  each  other.  For  example,  alternative  explanations  that  are  either  more 
specific  or  more  general  instances  of  X  should  not  discount  X.  Similarly, 
imagine  that  one  has  many  alternative  explanations  that  are  highly  redundant. 
To  treat  each  alternative  as  if  it  were  an  independent  explanation  would 
clearly  be  incorrect.  Indeed,  an  analogy  to  multiple  regression  with 
correlated  variables,  or  conditional  dependence  in  Bayesian  statistics,  is 
useful  here.  That  is,  the  strength  of  a  variable  is  seen  as  its  incremental 
contribution  to  explaining  Y  when  partialling  out  (or  otherwise  holding 
constant) ,  other  correlated  factors.  In  the  same  way,  the  plausibility  of  an 
alternative  can  depend  on  other  alternatives  as  well  as  its  similarity  to  X. 
Put  more  formally,  the  plausibility  of  alternative  Z^  needs  to  be  expressed 
conditionally  on  X  and  the  other  alternatives;  i.e.,  S(Zjc,y|x,Z1,Z2,  ...,  Zk_ 
The  descriptive  questions  concerning  how,  and  to  what  extent,  people  handle 
the  redundancy  between  specific  alternatives  have  barely  been  asked. 

(c)  The  importance  weight  in  equation  (8)  can  reflect  two  classes  of 
factors:  those  that  characterize  individual  differences  in  the  way  alterna¬ 
tives  are  utilized;  and,  process  variables  regarding  how  the  discount 
mechanism  incorporates  attention,  surprise,  serial  position  effects,  and  the 
like.  Consider  the  former  and  imagine  that  one  has  a  very  strong  belief  that 
X  causes  Y  such  that  specific  alternatives  are  weighted  zero  (w^  »  0,  for 
all  k).  Such  a  person's  beliefs  would  be  impervious  to  negative  evidence  in 
the  form  of  new  alternatives.  On  the  other  hand,  another  person  may  be  more 
receptive  to  alternatives  and  weight  them  heavily,  thereby  discounting  the 
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original  belief  substantially.  While  the  second  person  would  be  closer  to  the 
Popperian  ideal  than  the  first  (seeking  to  disconfirm  rather  than  confirm 
one's  hypotheses),  it  is  to  be  expected  that  people  will  vary  considerably  in 
the  degree  to  which  they  engage  in  discounting. 

the  second  class  of  factors  can  be  incorporated  via  the  importance  weight 
in  a  variety  of  ways.  For  example,  Einhorn  and  Hogarth  (1983)  included  a 
contrast  or  surprise  component  in  the  discount  process  by  making  the 
importance  weight  a  function  of  net  strength  at  the  k-1  level;  i.e., 

Sk(X,Y)  -  Sk1(X,Y)  -  w^_i  s<Zk,Y)  (10) 

The  model  assumes  that  as  net  strength  decreases  after  each  specific  alterna¬ 
tive,  the  importance  weight  also  decreases.  This  implies  that  an  alternative 
of  given  strength  will  discount  a  strong  explanation  more  than  a  weak  one 
(i.e.,  the  bigger  they  are,  the  more  they  fall).  On  the  other  hand,  a  weak 
causal  candidate  cannot  be  discounted  much  since  it  is  already  low;  indeed,  if 
X  were  worthless,  there  would  be  no  discounting.  The  model  was  tested  in 
several  experimental  situations  with  one  and  two  specific  alternatives  and 
provided  a  good  fit  to  the  data.  From  the  present  perspective,  such  models 
open  the  way  for  incorporating  various  cognitive  processes  (such  as  attention 
and  memory)  in  the  sequential  updating  of  causal  beliefs.  Indeed,  the 
importance  of  viewing  discounting  within  an  updating  framework  lies  in  linking 
basic  cognitive  processes  to  causal  inference  in  general  and  judgments  of 
probable  cause  in  particular. 

Multiple  causation,  covariation,  and  conjunction  effects 

The  framework  presented  thus  far  represents  what  might  be  called  the 
"univariate"  case;  that  is,  a  single  causal  candidate  X,  and  single  effect 
Y.  The  difficulties  of  delineating  the  "multivariate"  case  are  many,*  however, 


sine*  multiple  causation  is  crucial  to  many  real-world  inferences,  we  focus  on 
one  aspect  that  is  particularly  germane  to  the  conflict  between  causal  and 
probabilistic  thinking.  Specifically,  Tversky  and  Kahnenan  (1983)  have  shown 
that  people  often  judge  the  probability  of  a  conjunctive  event  as  being 
greater  than  the  probability  of  one  of  its  constituents.  Froa  a  statistical 
viewpoint,  this  is  an  error  since  a  conjunction  cannot  be  more  probable  than 
one  of  its  components.  Accordingly,  Tversky  and  Kahnenan  label  this  response 
a  "conjunction  fallacy"  giving  numerous  reasons  for  its  occurrence.  Two 
recent  papers,  however,  raise  questions  about  the  generality  of  conjunction 
effects  in  causal  explanations. 

In  the  first  paper  (Leddo,  Abelson,  &  Paget,  in  press) ,  subjects  were 
presented  with  vignettes  about  individuals  engaged  in  various  activities  and 
were  asked  to  judge  the  probability  of  both  single  and  conjunctive  reasons  for 
why  the  behavior  occurred.  While  conjunctive  effects  were  generally  found 
(i.e.,  explanations  with  more  reasons  were  judged  as  being  more  probable  than 
those  with  fewer  reasons),  they  were  not  found  when  the  story  involved  an 
actor  not  doing  something  (e.g.,  why  didn't  Fred  stop  at  the  Italian 
restaurant  while  he  was  driving  down  the  highway?).  Leddo  et  al.  argue  that 
in  some  situations,  the  judged  probability  of  an  explanation  may  increase  with 
the  number  of  reasons  up  to  a  point  after  which  it  declines.  This  notion  of 
"single-peakedness"  (Coombs  S  Avrunin,  1977)  over  the  number  of  reasons  in  an 
explanation  is  an  intriguing  idea  that  we  consider  below.  In  the  second 
paper,  Locksley  and  Stangor  (in  press)  presented  subjects  with  either  a  rare 
event  (suicide)  or  common  event  (getting  married)  and  asked  them  to  rank  10 
explanations  in  terms  of  the  probability  that  the  event  was  caused  by  that 
explanation.  The  10  explanations  consisted  of  both  single  reasons  and 
conjunctions.  The  results  showed  substantial  conjunction  effects  when  the 


outcome  was  rare,  but  considerably  fewer  effects  when  the  outcome  was  common. 

To  discuss  conjunction  effects  within  our  framework,  we  assume  that  In 

the  absence  of  other  cues  (or  holding  them  constant),  judgments  of  probable 

cause  will  be  based  on  the  covariation  cue.  We  further  assume  that  all  cells 

in  the  covariation  table  receive  a  non-zero  weight  (i.e. ,  B^  >  0)  such 

that  Q1  is  at  least  monotonic  with  statistical  correlation.  Mow  consider 

what  happens  when  two  factors,  each  positively  related  to  Y,  are  conjoined 

into  a  single  explanation  (denoted  as  (X^DX^).  The  scenarios  that  comprise 

the  covariation  of  (x  HX_)  with  Y  are  shown  in  Table  5.  As  was  the  case 

1  2 

Insert  Table  5  about  here 

with  single  factors,  a  represents  the  minimally  sufficient  scenarios  that 
contain  the  various  combinations  of  Y  and  Y  with  (X^flX^  and  (X^HX^. 
There  are  several  things  to  note  about  Table  5  in  comparison  with  Table  1 : 

(1 )  The  number  of  scenarios  in  which  (X^flX^  conjoins  with  other  conditions 
to  produce  Y  (cell  A),  must  be  less  than  or  equal  to  the  lesser  of  ayp|X 
and  •  This  must  hold  because  a^x  is  a  subset  of  the  scenarios  in 

which  X1  and  X2  each  combine  with  Y.  However,  by  adding  more  valid 
causes  to  the  scenario,  one  also  decreases  the  number  of  scenarios  in  which 
Y  occurs  with  (X^Dx^)  (cell  B).  Indeed,  the  conjunction  (XjHx^)  Is  more 
fully  sufficient  for  Y  since  the  uncertainties  associated  with  the 
conditionality  of  causation  are  reduced.  Therefore,  although  cell  A  can 
decrease  with  conjunctive  explanations,  the  B  cell  must  show  a  larger  decrease 
for  conjunctive  effects  to  occur;  (2)  The  base  rate  (or  marginal  probability) 
of  the  event  Y  is  a  constant  for  any  given  situation.  Therefore,  if  A  and  B 
decrease  due  to  a  conjunctive  explanation,  C  and  D  must  increase.  This  means 
that  adding  more  reasons  to  an  explanation  increases  its  sufficiency  for  Y 
at  the  "cost"  of  decreasing  its  necessity.  Hence,  the  nature  of  the  conflict 


that  is  a volt  ad  whan  reasons  are  added  to  an  explanation  results  from  the  net 
effect  on  covariation  of  decreases  in  cell  B  vs.  increases  in  cell  C.  More¬ 
over,  since  covariation  judgments  depend  on  the  relative  weight  given  to  cells 
B  vs.  C  (via  causal  questions)  as  well  as  the  differential  rates  of  change  in 
B  and  C  (which  depends  on  such  factors  as  the  base  rate  of  Y,  the  base  rates 
and  intercorrelations  between  the  various  X's,  etc.),  a  full  account  of 
conjunction  effects  would  require  a  complex  analysis  in  its  own  right. 

However,  from  our  perspective,  the  basic  point  is  that  the  conflict  between 
sufficiency  and  necessity  can  produce  judgments  of  probable  cause  that  are 
either  monotonic  br  single-peaked  with  respect  to  the  number  of  reasons  in  an 
explanation. 

How  does  the  above  framework  account  for  the  Leddo  et  al.  (in  press)  and 
Locksley  and  Stangor  (in  press)  results?  In  the  first  study,  conjunction 
effects  were  not  found  when  an  actor  didn't  do  something  (Fred  didn't  stop  at 
the  Italian  restaurant).  We  would  argue  that  the  base  rate  of  not  stopping  at 
the  restaurant  is  very  high  and  the  various  reasons  (wasn't  hungry,  had  no 
money,  dislikes  Italian  food)  are  each  sufficient  (or  close  to  sufficient)  for 
not  stopping.  When  one  conjoins  these  reasons  (he  didn't  stop  because  he 
wasn't  hungry  and  he  had  no  money  and  he  doesn't  like  Italian  food),  one 
creates  a  redundant  sufficient  condition;  i.e.,  a  scenario  that  is  still 
sufficient  if  a  conjunct  is  dropped.  Such  a  scenario  is  less  necessary  for 
Y  since  Y's  occurrence  Implies  that  any  single  conjunct  or  pair  of 
conjuncts  could  have  happened.  In  order  to  see  how  this  reduces  overall 
covariation,  consider  the  hypothetical  data  in  Table  6. 

insert”TabIe~6~about“herS 

In  Table  6a,  both  X1  and  X2  are  each  nearly  sufficient  for  Y  (the 
two  correlations  with  Y  are  .14).  In  Table  6b,  the  conjunction  (X^  flXj) 


Hypothetical  Exanples  of  Covariation  Patterns 
with  Conjunction  Effects 


is  sufficient  for  Y,  but  the  large  increase  in  the  C  cell  reduces  the  cor¬ 
relation  to  .11.  Zn  Table  6c,  (x^nx^nx^)  makes  up  a  redundant  sufficient 
condition  since  (X^DX2)  is  already  sufficient.  Again,  the  C  cell  increases 
(and  is  not  off-set  by  a  decrease  in  the  B  cell) ,  and  the  correlation  goes 
down  (r  »  .08).  Clearly,  in  this  situation,  increasing  the  number  of  reasons 
for  an  event  leads  to  a  monotonic  decline  in  judged  probability.  Of  course, 
we  could  have  changed  the  numbers  in  Table  6b  so  that  the  explanation  with  two 
reasons  was  more  likely  than  either  the  one  or  three-reason  explanation 
(yielding  a  single-peaked  function).  However,  the  point  of  the  example  is  not 
to  argue  for  a  particular  function  in  all  cases;  it  is  to  show  how  changes  in 
the  various  cells,  when  constrained  by  a  particular  base  rate,  limit  the 
nature  and  size  of  conjunction  effects.  Indeed,  the  example  illustrates  that 
conjunction  effects  are  less  likely  to  occur  when  the  event  of  interest  has  a 
high  base  rate.  This  is  consistent  with  our  analysis  since  the  B  cell  cannot  be 
reduced  much  (since  it  is  already  small),  relative  to  the  potential  increase  in 
the  C  cell.  Thus  the  empirical  findings  of  few  conjunction  effects  when  base 
rates  are  high  concur  with  the  implications  of  our  framework. 

Now  consider  rare  events  that  one  believes  to  result  from  the  conjunction 
of  many  reasons.  Note  that  by  definition,  a  rare  event  must  have  small  A 
and/or  C  cells  and  conversely,  large  B  and/or  0  cells.  When  multiple  reasons 
are  added  to  an  explanation,  the  effect  is  likely  to  lower  a  large  B  cell  sub¬ 
stantially  while  the  C  cell  cannot  be  increased  very  much.  Therefore, 
increasing  the  reasons  for  a  rare  event  will  likely  lead  to  an  increase  in 
covariation,  judged  probability,  and  conjunction  effects.  Again,  this  is 
consistent  with  empirical  findings  (Locksley  &  Stangor,  in  press).  However, 
we  would  expect  a  limit  to  the  number  of  reasons  one  can  add  to  an  explanation 
beyond  which  it  is  judged  less  probable.  As  far  as  we  know,  this  conjecture 
has  not  yet  been  put  to  empirical  test. 
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DISCUSSION 

Our  framework  for  studying  probable-caur '-judgments  has  three  major 
components:  (1)  the  causal  background;  (2)  the  cues-to-causality;  and 
(3)  discounting  via  specific  alternatives.  Our  discussion  therefore  centers 
on  the  research  questions  and  opportunities  that  are  raised  by  considering 
these  aspects  of  causal  judgments. 

the  causal  background.  In  our  view,  one  of  the  most  challenging  problems 
in  causal  inference  is  to  develop  a  theoretical  model  to  predict  the  particu¬ 
lar  causal  background  a  person  is  likely  to  adopt  in  a  given  situation.  That 
is,  whereas  our  framework  can  be  used  to  understand  the  relative  strength  of 
causal  candidates  conditional  on  a  given  background,  we  are  unable  to  make 
statements  about  how  people  come  to  focus  on  specific  backgrounds.  This 
predicament,  however,  is  common  to  other  areas  of  cognitive  psychology.  Two 
immediate  parallels  are  the  work  by  Simon  and  Hayes  on  "problem  isomorphs" 
(1976),  and  that  of  Tversky  and  Kahneman  on  "decision  frames"  (1981).  In 
common  with  the  effects  of  the  causal  background,  these  investigations 
demonstrate  how  different  representations  of  structurally  identical  stimuli 
can  have  important  impacts  on  responses  (for  a  more  general  review,  see 
Hogarth,  1982).  What  makes  these  problems  particularly  difficult  to  study  is 
that  the  meaning  people  give  to  stimuli  is  dependent  on  prior  knowledge  and 
experience.  For  example,  whereas  it  is  easy  to  say  that  people  are  responsive 
to  deviations,  such  deviations  are  often  defined  relative  to  ejqaectations; 
thus,  any  experimental  work  has  to  deal  operationally  with  controlling 
these.  Further  confounding  the  issue  is  the  fact  that  once  a  person  has 
adopted  a  particular  field,  the  cues-to-causality  are  likely  to  reinforce  the 
initial  viewpoint  and  thus  the  person's  understanding  of  the  situation. 

Although  the  determination  of  the  causal  background  remains  problematic, 
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cons Ida r  the  distinction  between  causes  and  conditions  made  earlier,  That  is. 
in  discussing  how  causal  variables  in  one  background  can  become  conditions  in 
another,  we  stated  that  causal  factors  are  generally  intrusive  or  abnormal 
events  rather  than  standing  states  or  normal  events.  However,  standing  states 
can  be  seen  as  causal  if  the  outcome  to  be  explained  is  a  standing  state.  For 
example,  what  is  the  cause  of  "poverty?”  Since  the  effect  to  be  explained  is 
a  standing  state  of  large  duration  and  magnitude,  the  congruity  cue  demands 
that  the  cause  be  of  comparable  duration  and  magnitude.  Indeed,  to  suggest 
a  simple  cause  is  to  be  accused  of  being  "simplistic,"  only  "root"  or 
"underlying"  causes  will  do.  Therefore,  standing  states  can  be  causal  if  the 
need  to  maintain  congruity  is  important.  However,  even  in  this  case,  note  the 
subtle  effect  of  the  causal  background  in  the  posing  of  the  question;  i.e., 
poverty  requires  an  explanation  because  it  is  seen  as  a  deviation  from  some 
desired  state.  On  the  other  hand,  if  poverty  were  seen  as  a  usual  state,  one 
would  be  asking,  what  causes  prosperity?  Therefore,  even  when  standing  states 
are  viewed  as  causal,  we  would  argue  that  there  is  an  implicit  deviation  in 
the  posing  of  the  question. 

Finally,  the  role  of  deviations  in  a  background  goes  beyond  its  impor¬ 
tance  in  causal  inference.  For  example,  when  expectations  that  rest  on  an 
assumed  background  are  violated,  surprise  can  be  an  important  cue  for 
reorganizing  or  restructuring  one's  hypotheses  and  explanations.  Imagine  a 
hit-and-run  accident  in  which  all  the  witnesses  said  that  the  offending  car 
was  going  73  miles  per  hour  at  the  moment  of  inpact.  Since  we  expect  much 
greater  variability  in  such  estimates,  as  well  as  round  numbers,  this 
surprising  unanimity  might  cue  one  to  ask  whether  the  witnesses  had  colluded 
in  their  responses.  Similarly,  the  structure  of  outcomes  can  suggest  new 
hypotheses  such  that  the  diagnosis  contradicts  the  surface  meaning  of  the 
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evidence.  Bius,  scientific  data  that  ace  too  perfect  can  suggest  fraud  (see, 
for  example,  Kanin,  1974,  on  Burt's  twin  data;  Bishop,  et  al,  1975,  on 
Mendel's  pea  experiment),  evidence  in  a  trial  that  is  too  consistent  and 
obvious  can  suggest  the  defendant  was  "f rawed, “  and  one  can  "proteeteth  too 
much”  in  a  variety  of  circumstances.  Such  examples  illustrate  that  violations 
of  expectations  can  trigger  a  re-structuring  of  alternatives.  Of  course, 
specifying  the  conditions  that  lead  to  re-structuring  as  opposed  to  other 
responses  remains  an  important  and  unanswered  question. 

The  cues-to— causality.  Zn  discussing  the  cues-to-causality  and  how  these 
combine  with  the  background,  we  organize  our  comments  around  three  topics: 

(a)  the  functions  the  cues  perform  for  the  organism  and  the  attendant  costs 
and  benefits;  (b)  unresolved,  and  possibly  unresolvable  issues  concerning  the 
cues;  and  (c)  opportunities  for  further  research. 

(a)  the  object  of  establishing  causal  relations  is  to  make  sense  of  the 
world  so  that  the  organism  can  achieve  its  goals.  From  this  perspective,  both 
the  adoption  of  particular  causal  backgrounds  and  the  use  of  the  cues-to- 
causality  are  remarkably  efficient  mechanisms  in  that  they  reduce  the  scope  of 
the  inferential  task  within  limited  information  processing  constraints.  That 
is,  the  field  delimits  the  focus  of  attention,  and  the  cues  constrain  the 
types  of  causal  relations  constructed  (cf.  Table  2).  Moreover,  given  the 
inherent  complexity  of  the  environment,  one  can  also  argue  that  there  are 
functional  advantages  to  both  the  fluctuating  nature  of  attention,  and  the 
probabilistic  character  of  the  cues.  Specifically,  fluctuations  in  attention 
permit  alternative  representations  of  the  same  or  similar  tasks  (see  above), 
and  probabilistic  cues  imply  that  one  need  not  always  construct  the  same 
scenarios  within  a  field.  However,  we  note  three  important  trade-offs  that 
are  implicit  in  these  Inferential  mechanisms:  (i)  "order-out-of -chaos”  vs. 
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creativity.  As  just  noted,  the  causal  field  and  the  cues-to-cauaality  play 
important  roles  in  limiting  the  number  of  interpretations  people  make  in 
inferential  tasks,  and  thus  in  creating  "order-out-of -chaos."  Furthermore, 
the  adaption  of  a  particular  background  and  the  use  of  the  cues  proceed 
quickly  and  are  often  marked  by  a  lack  of  awareness  that  a  delimiting  process 
has  taken  place.  The  benefits  to  be  gained  from  such  automatized  processes 
are  large.  However,  they  come  at  the  cost  of  reducing  the  probability  that 
people  can  achieve  more  creative  representations  of  inferential  tasks. 

Indeed,  Campbell  (1960)  has  stressed  the  importance  of  deliberately 
introducing  random  variation  to  stimulate  creative  efforts,  especially  in 
science.  Without  such  random  perturbations,  he  argues  that  the  forces  that 
maintain  a  person's  particular  conception  of  a  problem  are  too  strong. 
Moreover,  the  literature  on  creativity  has  many  examples  of  techniques  that 
are  aimed  precisely  at  making  people  aware  of  the  delimiting  assumptions  they 
bring  to  tasks  (e.g.,  Adams,  1976).  In  addition,  when  using  such  techniques, 
people  are  often  requested  to  refrain  from  counterfactual  reasoning  and  to 
make  specific  use  of  analogies  and  paradox  to  enjoin  previously  disconnected 
ideas.  In  short,  to  restructure  problems  in  creative  ways  frequently  requires 
attempts  to  counter  the  habitual  forces  of  causal  reasoning;  (ii)  acquiring 
superstitions  vs.  causal  knowledge.  Since  the  cues  are  probabilistic 
indicators  of  causal  relations,  it  follows  that  they  will  sometimes  indicate 
invalid  causal  relations.  Moreover,  these  relations  could  well  be  reinforced 
by  environments  in  which  they  cannot  be  adequately  tested  (cf.  Einhorn,  i960). 
Thus,  the  cues-to-causality  can  also  lead  to  the  acquisition  of  superstitious 
beliefs  (cf.  Skinner  1966);  (iii)  imagination  vs.  uncertainty.  By  constrain¬ 
ing  the  interpretation  of  information,  the  cuss  also  reduce  feelings  of 
uncertainty  in  inference.  However,  given  the  probabilistic  nature  of  the 


cues,  the  level  of  uncertainty  experienced  could  Increase  if  people  exercised 
imagination  in  constructing  alternative  causal  scenarios  that  were  either 
consistent  with  or  even  violated  the  cues.  Zn  this  sense,  therefore,  the  cues 
mediate  the  trade-off  people  make  between  exercising  imagination  and 
experiencing  feelings  of  uncertainty. 

(b)  He  have  assumed  throughout  the  paper  that  the  cues-to-causality  have 
imperfect  but  non-zero  ecological  validities,  i.e.,  each  cue  is  predictive  of 
a  true  causal  relation.  How  do  we  know  this?  Simply  put,  we  don't.  The 
reason  is  that  without  some  measure  of  "true"  causality,  no  determination  of 
accurate  causal  knowledge  is  possible.  However,  the  fact  that  the  cues  we 
have  considered  are  implicated  in  a  wide  variety  of  studies  with  both  human 
and  animal  subjects,  leads  us  to  believe  that  they  would  not  continue  to  be 
used  if  they  were  useless.  Therefore,  our  argument  is  a  functional  and 
practical  one;  viz.,  given  the  importance  of  learning  and  inferring  causal 
relations  for  survival,  we  do  not  believe  that  the  cues  on  which  this  depends 
are  totally  worthless.  On  the  other  hand,  we  do  not  advocate  the  position 
that  if  something  is  used,  it  must  be  beneficial  to  the  organism.  Such  a 
position  is  untenable  for  many  reasons  (see  Einhorn  G  Hogarth,  1981). 

Second,  we  have  also  argued  that  the  cues  are  partially  redundant  (inter- 
correlated)  and  that  this  affects  the  inference  process.  However,  to  show 
how,  and  to  what  extent  the  cues  are  correlated  would  require  an  elaborate 
(and  problematic)  ecological  analysis  that  is  beyond  the  scope  of  this  paper. 
Nevertheless,  the  assumption  of  correlated  cues  seems  warranted  since  people 
have  strong  expectations  concerning  what  cues  go  together.  Indeed,  just  as  in 
the  perception  of  incomplete  figures  (where  one  fills  in  the  missing  parts), 
scenarios  are  filled  in  by  assuming  that  cues  not  explicitly  mentioned  are 
present.  Thus,  the  fact  that  one  generally  perceives  the  world  as  coherent, 


suggest a  that  the  cues-to-causality  are  redundant  to  some  degree. 

A  third  problematic  issue  raised  by  our  framework  concerns  the  origins  of 
causal  reasoning.  What  are  the  relations  between  the  cues,  schemas,  and  the 
notion  that  people  seem  to  need  to  perceive  the  transfer  of  causal  "energy" 
via  a  link  mechanism  prior  to  making  attributions  of  cause?  As  argued  by  Cook 
and  Campbell  (1979),  amongst  others,  the  need  to  perceive  cause  has  important 
evolutionary  significance  in  environments  where  the  discovery  of  causal 
relations  has  high  survival  value.  Furthermore,  as  noted  earlier,  we  believe 
that  Seligman's  (1970)  analysis  showing  how  both  humans  and  animals  are 
"contra-prepared"  to  learn  certain  types  of  contingencies  does  suggest  some 
genetic  predispositions  toward  certain  kinds  of  causal  attributions.  If  one 
concedes  this  point,  and  accepts  the  fact  that  some  part  of  causal  reasoning 
is  as  yet  "unknowable,"  we  believe  that  the  role  of  the  cues  and  their 
relation  with  schemas  is  fairly  straightforward.  That  is,  the  cues  both 
constrain  the  causal  schemas  people  can  imagine  and,  in  an  important  sense, 
are  the  building  blocks  of  such  schemas.  One  interesting  aspect  of  schema 
construction  lies  in  how  schemas  change  as  people  acquire  greater  knowledge 
over  time.  At  one  extreme  is  the  tendency  to  anthropomorphize  phenomena  one 
does  not  understand.  For  example,  consider  how  small  children  tend  to  ascribe 
personalities  to  the  elements  (e.g.,  the  wind)  so  that  they  can  explain 
changes  in  the  environment  by  attributing  intentions  to  physical  forces  (see 
e.g.,  Piaget,  1974).  Moreover,  note,  as  discussed  earlier  in  reference  to 
social  causation,  that  as  soon  as  one  attributes  intentions,  motives,  and 
goals,  causal  reasoning  is  considerably  facilitated  since  the  constraints 
Implied  by  the  cues-to-causality  are  automatically  respected.  However,  over 
time  such  anthropomorphic  explanations  are  dropped  in  favor  of  accounts  framed 
in  terms  of  physical  causation.  Thus,  mature  adults  are  often  skeptical  of 


explanation*  of  "psychic"  phenomena  unless  they  are  couched  in  terms  of  models 
involving  the  transmission  of  some  form  of  energy.  Indeed,  it  is  perhaps 
paradoxical  that  adult  believers  in  ESP  tend  to  be  found  either  among  the 
highly  credulous  or  scientifically  highly  sophisticated  segments  of  the 
population.  However,  the  bases  of  the  beliefs  in  the  two  groups  are  quite 
different.  That  is,  whereas  the  former  are  prepared  to  believe  in  invisible 
mystical  forces,  the  latter  also  postulate  invisible  forces  but  deny  that 
they  are  mystical. 

(c)  Our  framework  suggests  several  avenues  of  research  concerning  the 
cues  and  how  these  combine  with  the  background  in  affecting  gross  strength. 
First,  whereas  we  believe  that  equation  (7)  is  a  useful  summary  of  the 
existing  literature,  the  functional  form  of  the  equation  both  needs  to  be 
tested  and  suggests  several  explicit  hypotheses.  For  example,  would  the 
necessity  of  the  temporal  order  cue  be  maintained  if  there  were  some  doubt  as 
to  the  relative  timing  of  potential  causa  and  effect?  Can  one  demonstrate 
experimentally  the  effects  of  salience  via  the  background  cue  in  the  hypothe¬ 
sized  manner?  The  hypothesized  form  of  the  link  variable  (Q^)  suggests 
experiments  in  which  one  systematically  varies  both  contiguity  (thus  affecting 
the  number  of  links)  and  covariation  at  the  link  level.  Second,  our  framework 
places  great  emphasis  on  how  the  background  and  cues  constrain  the  construc¬ 
tion  of  causal  scenarios.  However,  it  should  also  be  possible,  within  the 
framework,  to  explore  systematically  factors  that  affect  the  particular 
scenarios  people  select  from  the  "feasible"  sets  of  scenarios.  Different 
levels  of  substantive  expertise,  for  instance,  should  lead  to  scenarios  that 
vary  in  complexity  as  measured  by  the  number  of  links  in  the  causal  chain. 
Experts,  one  will  recall,  view  causal  mechanisms  in  more  reductionistic  terms, 
i.e.,  their  field  differs  from  novices.  Alternatively,  one  could  also  examine 
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the  effect*  of  attention  by  making  some  Information  more  salient  than  others. 
KLnhorn  and  Hogarth  (1982a),  for  example,  found  that  judgments  of  causal 
strength  could  be  affected  by  making  the  cue  of  contiguity  differentially 
salient  in  experimental  stimuli.  However,  they  did  not  explore  the  nature  of 
the  causal  schemas  people  used  in  the  two  conditions.  Moreover,  to  understand 
these  kinds  of  issues  researchers  will  have  to  go  beyond  eliciting  data  via 
judgments  on  rating  scales  but  use  more  data-in tensive  collection  techniques 
such  as  verbal  protocols.  As  revealed  in  our  analysis  of  the  covariation  cue, 
the  manner  in  which  a  causal  question  is  asked  (e.g.,  emphasizing  the 
conditionality  or  multiplicity  of  causation)  could  well  affect  judgments  of 
causal  strength  by  emphasizing  particular  cells  of  2  x  2  covariation  matrices 
at  the  expense  of  others.  Moreover,  one  could  argue  that  this  phenomenon  is 
more  likely  to  occur  in  situations  involving  social  as  opposed  to  physical 
causation.  The  reason  is  that  attributions  of  social  causation  that  use 
concepts  such  as  goals,  motives,  and  intentions  are  particularly  sensitive 
to  the  covariation  cue  and  operate  with  short  links  between  cause  and  effect. 
In  many  situations  involving  physical  causation,  however,  if  the  intermediate 
scenarios  involve  several  links,  the  relative  effect  of  the  covariation  cue 
is  reduced. 

One  intriguing  possibility  centers  on  incorporating  the  cues-to-causality 
in  "expert  systems"  used  in  artificial  intelligence  (Duda  s  Shortliffe,  1983). 
Two  important  problems  faced  by  designers  of  such  systems  are  a  lack  of 
theoretical  structures  for  handling  diagnostic  tasks  and  the  corresponding 
level  of  detailed  substantive  knowledge  that  has  to  be  programmed  in  order  to 
compensate  for  the  lade  of  such  diagnostic  theory.  If  one  considers  the  cues 
as  providing  a  set  of  meta-principles  for  guiding  problem  search,  we  see 
considerable  advantage  in  using  them  in  expert  systems  to  determine  regions 
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of  feasible  solutions.  Moreover,  even  though  we  cannot  demonstrate  that  the 
cues  have  ecological  validity  (sea  above),  we  can  point  to  a  successful 
precedent  of  this  type.  Specifically,  recall  how  successful  "bootstrapping” 
models  have  been  in  the  area  of  clinical  prediction  even  in  the  absence  of 
measurable  criteria  (see  e.g.,  Dawes,  1971;  Camerer,  1981).  At  the  very 
least,  our  suggestions  could  be  used  to  reduce  inconsistency  in  causal 
judgment  (cf.  Hammond,  Rursch  £  Todd,  1964;  Goldberg,  1970). 

Discounting  via  specific  alternatives.  Zn  addition  to  lowering  the 
covariation  between  X  and  Y,  specific  alternatives  point  to  the  possibility  of 
replacing  the  current  explanation  rather  than  simply  discontinuing  it.  That 
is,  since  the  goal  of  causal  inference  is  to  find  some  explanation  for  the 
observed  effects,  the  discounting  of  an  explanation  by  specific  alternatives 
still  leaves  one  with  the  question,  "If  X  didn't  cause  Y,  what  did?"  There¬ 
fore,  while  the  testing  of  hypotheses  via  comparison  with  alternatives  is  part 
of  the  causal  inference  process,  one  is  still  left  with  finding  a  plausible 
explanation.  In  fact,  the  distinction  between  testing  hypotheses  and 
searching  for  better  ones  can  be  likened  to  the  difference  between  a 
"disconfirmation"  vs.  "replacement"  mode  of  inference.  The  replacement  view 
is  consistent  with  the  Kuhnian  notion  that  theories  in  science  are  not 
discarded,  despite  evidence  to  the  contrary,  if  they  are  not  replaced  by 
better  alternatives  (Kuhn,  1962).  Me  believe  that  the  replacement  view  is 
equally  strong  in  everyday  Inference.  A  useful  analogy  might  be  the 
following:  how  many  people  would  read  detective  stories  if  the  author  only 
revealed  who  didn't  do  it?  Moreover,  note  that  replacement  is  more  powerful 
than  disconfirmation  since  the  former  subsumes  the  latter. 

Given  that  specific  alternatives  play  a  central  role  in  the  causal 
Inference  process,  how  do  they  discount  an  explanation  and  even  replace  it? 


Our  earlier  discussion  focused  on  the  former  question  since  we  are  unaware  of 
research  on  the  latter.  However,  although  we  treated  discounting  as  part  of  a 
■ore  general  process  concerned  with  the  updating  of  beliefs,  several  important 
questions  remain.  For  example,  earlier  we  discussed  the  issue  of  the  redun¬ 
dancy  of  explanations  and  the  difficulty  of  incorporating  this  factor  into  the 
discounting  process.  A  further  complication  concerns  how  the  redundancy  of 
explanations  is  affected  by  their  generality.  For  example,  in  decision- 
making- under-uncertainty,  the  expected  utility  model,  (E(U)],  is  a  major 
theory  of  choice  (Schoemaker,  1982).  However,  a  number  of  coupe ting 
theoretical  positions  have  been  developed  in  recent  years  (Kahneman  £  Tversky, 
1979)  Karmarkar,  1979;  Chew  fi  MacCrimmon,  1979;  Bell,  1982).  Since  these 
theories  provide  alternative  explanations  for  choices  amongst  gambles, 
evidence  in  their  favor  should  discount  the  strength  of  E(U).  On  the  other 
hand,  the  competitors  can  be  seen  as  generalizations  of  the  E(u)  model  and 
thus,  evidence  in  their  favor  can  be  viewed  as  providing  some  support  for 
B(0).  Moreover,  since  the  competitors  have  numerous  structural  similarities, 
evidence  supporting  them  may  be  due  to  their  common  rather  than  distinguishing 
features.  Second,  while  we  discussed  the  discounting  process  in  terms  of  a 
sequential  updating  model,  there  are  many  procedural  factors  that  could  affect 
the  amount  and  type  of  discounting  (sea  Lopes,  1982).  For  example,  would  the 
■mount  of  discounting  be  different  if  multiple  alternatives  were  considered  as 
a  unit  rather  than  evaluated  piece-by-piece?;  Does  the  length  and  coop  laxity 
of  ii  formation  affect  attention  such  that  order  effects  occur?  If  so,  under 
what  conditions  is  primacy  more  likely  than  recency,  and  vice  versa?  Third, 
the  updating  of  causal  beliefs  depends  on  positive  as  well  as  negative 
evidence.  Therefore,  a  full  updating  model  will  have  to  deal  with  both  the 
discounting  and  accretion  of  causal  strength. 


CONCLUSION 


Bertrand  Russell  (1948)  stressed  the  uncertain  nature  of  inferences  in 
science  and  common  sense  by  noting  that  they  "differ  from  those  of  deductive 
logic  and  mathematics  in  a  very  important  sense,  namely,  that  when  the 
premises  are  true  and  the  reasoning  correct,  the  conclusion  is  only  probable" 
(1948,  p.  335,  emphasis  in  original).  Since  causal  inference  is  an  essential 
and  ubiquitous  cognitive  activity,  we  have  focused  on  research  concerning  how 
people  make  judgments  of  probable  cause.  In  providing  both  a  conceptual 
framework  and  review  of  the  relevant  literature,  we  have  tried  to  provide 
links  between  the  extensive  literature  on  causation  and  the  burgeoning 
psychological  literature  on  judgment-under-uncertainty.  In  attempting  this 
integration,  we  have  argued  that  judgments  of  probable  cause  are  affected  by 
three  main  factors:  a  causal  background  or  field,  probabilistic  cues-to- 
causality,  and  a  discounting  process  for  specific  alternative  explanations. 
Moreover,  these  general  ideas  can  be  summarized  by  a  perceptual  analogy  in 
which  causal  candidates  are  differences -in-a-background  (figures  are  seen 
against  ground),  good  explanations  arise  from  internally  consistent  patterns 
of  cues  (good  figures  form  a  "gestalt”),  and  good  explanations  have  few 
plausible  alternatives  (as  do  good  figures). 

Whereas  our  framework  accounts  for  much  literature  and  leads  to  many 
testable  implications,  it  by  no  means  explicates  all  aspects  of  causal 
reasoning.  In  particular,  inferences  made  on  the  basis  of  complex  scenarios, 
defining  and  measuring  the  "coherence"  of  a  causal  explanation  (Trabasso  et 
al. ,  1984),  issues  of  multiple  and  redundant  causation,  etc.,  present 
formidable  difficulties  and  challenges  for  psychological  research.  However, 
given  the  complexity  of  these  issues,  it  seems  appropriate  to  have  started 
with  a  model  based  on  alternatives,  background,  and  cues;  i.e.,  the  ABC  of 


causal  inference 
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1The  assumption  that  X  +  X  +  X  ■  1  implies  that  gross  strength  is 

L  *1  4 

made  up  of  two  major  parts:  a  multiplicative  component  *  QL)  and 

a  weighted  average  (X^Q^  +  X^  +  X^Q' ) .  Furthermore,  if  0  <  Q'  <  1,  the 
gross  strength  of  X  for  Y  will  be  appropriately  bounded. between  0  and  1;  i.e. 


0  <  s(X,  Y)  «  1. 
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